Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????^	00111111001111110011111100111111001111110011111101011110	3f3f3f3f3f3f5e
SJIS-WIN	??五??須^	001111110011111110001100110111000011111100111111100100000111101101011110	3f3f8cdc3f3f907b5e
EUC-JP	??五??須^	001111110011111110111000110111100011111100111111101111111101110001011110	3f3fb8de3f3fbfdc5e
UTF-8	念렊五念렊須^	11101111101001101010001111101011101000001000101011100100101110101001010011101111101001101010001111101011101000001000101011101001101000001000100001011110	efa6a3eba08ae4ba94efa6a3eba08ae9a0885e
UHC	念렊五念렊須^	11100110111101101000111010100001111001111110100111100110111101101000111010100001111000101100111001011110	e6f68ea1e7e9e6f68ea1e2ce5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)