Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	????綵	001111110011111100111111001111111110001101011100	3f3f3f3fe35c
EUC-JP	????綵	001111110011111100111111001111111110010110111101	3f3f3f3fe5bd
UTF-8	灐멸렊累綵	111001111000000110010000111010111010100110111000111010111010000010001010111011111010010110001111111001111011011010110101	e78190eba9b8eba08aefa58fe7b6b5
UHC	灐멸렊累綵	11111011101001011011100011101010100011101010000111010010111010011111001111110111	fba5b8ea8ea1d2e9f3f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)