To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^z???^zB 0011111100111111001111110101111001111010001111110011111100111111010111100111101001000010 3f3f3f5e7a3f3f3f5e7a42
SJIS-WIN ???^z???^zB 0011111100111111001111110101111001111010001111110011111100111111010111100111101001000010 3f3f3f5e7a3f3f3f5e7a42
EUC-JP ???^z???^zB 0011111100111111001111110101111001111010001111110011111100111111010111100111101001000010 3f3f3f5e7a3f3f3f5e7a42
UTF-8 묳뫗뫒^z묳뫗뫒^zB 1110101110101100101100111110101110101011100101111110101110101011100100100101111001111010111010111010110010110011111010111010101110010111111010111010101110010010010111100111101001000010 ebacb3ebab97ebab925e7aebacb3ebab97ebab925e7a42
UHC 묳뫗뫒^z묳뫗뫒^zB 1001001001001010100100011011100110010001101101000101111001111010100100100100101010010001101110011001000110110100010111100111101001000010 924a91b991b45e7a924a91b991b45e7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)