To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 瘟??言?????B 111000011000100100111111001111111000110010111110001111110011111100111111001111110011111101000010 e1893f3f8cbe3f3f3f3f3f42
EUC-JP 瘟??言?????B 111000011110100100111111001111111011100011000000001111110011111100111111001111110011111101000010 e1e93f3fb8c03f3f3f3f3f42
UTF-8 瘟룩뇮言됭갬銳볠뜆B 11100111100110001001111111101011101000111010100111101011100001111010111011101000101010001000000011101011100100001010110111101010101100001010110011101001100010101011001111101011101100111010000011101011100111001000011001000010 e7989feba3a9eb87aee8a880eb90adeab0ace98ab3ebb3a0eb9c8642
UHC 瘟룩뇮言됭갬銳볠뜆B 11101000101100001011011111101000100001111001001111100101111010111000100111101000101100001011011111100111111001011001001111100110100011011000100101000010 e8b0b7e88793e5eb89e8b0b7e7e593e68d8942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)