To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒸??荳←繒??? 10001111111101100011111100111111111001001011100010000001101010011111101110001111001111110011111100111111 8ff63f3fe4b881a9fb8f3f3f3f
EUC-JP 蒸?瀣荳←繒??? 10111110111110000011111110001111110010011011000111101000101110101010001010101011100011111101010011010100001111110011111100111111 bef83f8fc9b1e8baa2ab8fd4d43f3f3f
UTF-8 蒸렧瀣荳←繒목렰렮 111010001001001010111000111010111010000010100111111001111000000010100011111010001000110110110011111000101000011010010000111001111011100110010010111010111010101010101001111010111010000010110000111010111010000010101110 e892b8eba0a7e780a3e88db3e28690e7b992ebaaa9eba0b0eba0ae
UHC 蒸렧瀣荳←繒목렰렮 111100011111101010001110101101101111101010101110110101001110010110100001111001111111000111111001101110001111000110001110101111011000111010111011 f1fa8eb6faaed4e5a1e7f1f9b8f18ebd8ebb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)