To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 泣??堪薇迅? 1000101110000011001111110011111110001010101011001110010101001110100100000111011000111111 8b833f3f8aace54e90763f
EUC-JP 泣?檉堪薇迅? 10110101111000110011111110001111110001011011101110110100101011101110100110101111101111111101011100111111 b5e33f8fc5bbb4aee9afbfd73f
UTF-8 泣렧檉堪薇迅렒 111001101011001110100011111010111010000010100111111001101010101010001001111001011010000010101010111010001001011010000111111010001011111110000101111010111010000010010010 e6b3a3eba0a7e6aa89e5a0aae89687e8bf85eba092
UHC 泣렧檉堪薇迅렒 1110101111101000100011101011011011101111111000001100101011101101110110101011100111100011111101101000111010100111 ebe88eb6efe0caeddab9e3f68ea7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)