To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8宜??矣??域??日??矣??筌?? 1110000110011111001111111000001001010111100010110101100000111111001111111110000111100001001111110011111110001000111001100011111100111111100100111111101000111111001111111110000111100001001111110011111111100010101000110011111100111111 e19f3f82578b583f3fe1e13f3f88e63f3f93fa3f3fe1e13f3fe2a33f3f
EUC-JP 癲?8宜??矣??域??日??矣??筌?? 1110001010100001001111111010001110111000101101011011100100111111001111111110001011100011001111110011111110110000111010000011111100111111110001101111110000111111001111111110001011100011001111110011111111100100101001010011111100111111 e2a13fa3b8b5b93f3fe2e33f3fb0e83f3fc6fc3f3fe2e33f3fe4a53f3f
UTF-8 癲쒕8宜루뎁矣섍강域뱄퐦日뗥꼧矣⑹뒞筌곗뼑 111001111001100110110010111011001001001010010101111011111011110010011000111001011010111010011100111010111010001110101000111010111000111010000001111001111001111110100011111011001000010010001101111010101011000010010101111001011001111110011111111010111011000110000100111011011001000010100110111001101001011110100101111010111001011110100101111010101011110010100111111001111001111110100011111000101001000110111001111010111001001010011110111001111010110110001100111010101011001110010111111010111011110010010001 e799b2ec9295efbc98e5ae9ceba3a8eb8e81e79fa3ec848deab095e59f9febb184ed90a6e697a5eb97a5eabca7e79fa3e291b9eb929ee7ad8ceab397ebbc91
UHC 癲쒕8宜루뎁矣섍강域뱄퐦日뗥꼧矣⑹뒞筌곗뼑 111011111010011010011100111010111010001110111000111010111111000110110111111001111011010110101010111010111111100010011000111010101011000010101101111001101011010010111001111011111011110110001111111011001110110110001011111001011000010010000100111010111111100010101001111011001000101010011010111011111010011110110000111011001001011010011001 efa69ceba3b8ebf1b7e7b5aaebf898eab0ade6b4b9efbd8feced8be58484ebf8a9ec8a9aefa7b0ec9699

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)