To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥??游??蹂≪? 10011010100010110011111100111111100111111110000000111111001111111110011011111000100000011110000100111111 9a8b3f3f9fe03f3fe6f881e13f
EUC-JP 嚥??游??蹂≪? 11010011111010110011111100111111110111101110001000111111001111111110110011111010101000101110001100111111 d3eb3f3fdee23f3fecfaa2e33f
UTF-8 嚥싲쉥游룝벧蹂≪쑂 111001011001101010100101111011001000101110110010111011001000100110100101111001101011100010111000111010111010001110011101111010111011001010100111111010001011100110000010111000101000100110101010111011001001000110000010 e59aa5ec8bb2ec89a5e6b8b8eba39debb2a7e8b982e289aaec9182
UHC 嚥싲쉥游룝벧蹂≪쑂 111001101011111110011010111010111011110110101011111010101111110110110111111001001011101010100110111010111011001110100001111011001001110010100010 e6bf9aebbdabeafdb7e4baa6ebb3a1ec9ca2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)