Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	闇λ?爾	10001000110001011000001111001001001111111000111010100010	88c583c93f8ea2
EUC-JP	闇λĿ爾	101100001100011110100110110010111000111110101001101010011011110010100100	b0c7a6cb8fa9a9bca4
UTF-8	闇λĿ爾	11101001100101111000011111001110101110111100010010111111111001111000100010111110	e99787cebbc4bfe788be
UHC	闇λĿ爾	1110010011100001101001011110101110101000101010001110110010110011	e4e1a5eba8a8ecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)