To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 筌??泣ワ????億??筌??泣ワ????億??E 111000101010001100111111001111111000101110000011100000111000111100111111001111110011111100111111100010011010110100111111001111111110001010100011001111110011111110001011100000111000001110001111001111110011111100111111001111111000100110101101001111110011111101000101 e2a33f3f8b83838f3f3f3f3f89ad3f3fe2a33f3f8b83838f3f3f3f3f89ad3f3f45
EUC-JP 筌??泣ワ????億??筌??泣ワ????億??E 111001001010010100111111001111111011010111100011101001011110111100111111001111110011111100111111101100101010111100111111001111111110010010100101001111110011111110110101111000111010010111101111001111110011111100111111001111111011001010101111001111110011111101000101 e4a53f3fb5e3a5ef3f3f3f3fb2af3f3fe4a53f3fb5e3a5ef3f3f3f3fb2af3f3f45
UTF-8 筌뚮뿦泣ワ쭫類ㅺ묘億됲냹筌뚮뿦泣ワ쭫類ㅺ묘億됲냼E 11100111101011011000110011101011100110101010111011101011101111111010011011100110101100111010001111100011100000111010111111101100101011011010101111101111101001111001000011100011100001011011101011101011101011001001100011100101100001001000010011101011100100001011001011101011100000111011100111100111101011011000110011101011100110101010111011101011101111111010011011100110101100111010001111100011100000111010111111101100101011011010101111101111101001111001000011100011100001011011101011101011101011001001100011100101100001001000010011101011100100001011001011101011100000111011110001000101 e7ad8ceb9aaeebbfa6e6b3a3e383afecadabefa790e385baebac98e58484eb90b2eb83b9e7ad8ceb9aaeebbfa6e6b3a3e383afecadabefa790e385baebac98e58484eb90b2eb83bc45
UHC 筌뚮뿦泣ワ쭫類ㅺ묘億됲냹筌뚮뿦泣ワ쭫類ㅺ묘億됲냼E 11101111101001111000110011101011100101111010011011101011111010001010101111101111101001111001111111101011101110101010010011101010101110011010011011100101111000101000100111101101100001101000100111101111101001111000110011101011100101111010011011101011111010001010101111101111101001111001111111101011101110101010010011101010101110011010011011100101111000101000100111101101100001101000110001000101 efa78ceb97a6ebe8abefa79febbaa4eab9a6e5e289ed8689efa78ceb97a6ebe8abefa79febbaa4eab9a6e5e289ed868c45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)