Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	髮区ｪ榊建隨	1110100110011011100010111110011010101010100011011110010110001100100110101110011110101100	e99b8be6aa8de58c9ae7ac
EUC-JP	髮区ｪ榊建隨	111100011111101110110110111010001000111010101010101110101110011110110111111110101110111010101110	f1fbb6e88eaabae7b7faeeae
UTF-8	髮区ｪ榊建隨	111010011010101110101110111001011000110010111010111011111011110110101010111001101010011010001010111001011011101110111010111010011001101010101000	e9abaee58cbaefbdaae6a68ae5bbbae99aa8
UHC	髮???建隨	110110111010010100111111001111110011111111001011111011111110001011001011	dba53f3f3fcbefe2cb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)