Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???I	00111111001111110011111101001001	3f3f3f49
SJIS-WIN	巐呰嚥I	11111010101101101001100111101101100110101000101101001001	fab699ed9a8b49
EUC-JP	巐呰嚥I	1000111110111011111110011101001011101111110100111110101101001001	8fbbf9d2efd3eb49
UTF-8	巐呰嚥I	11100101101101111001000011100101100100011011000011100101100110101010010101001001	e5b790e591b0e59aa549
UHC	??嚥I	0011111100111111111001101011111101001001	3f3fe6bf49

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)