Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	叔須?五	10001111011001101001000001111011001111111000110011011100	8f66907b3f8cdc
EUC-JP	叔須?五	10111101110001111011111111011100001111111011100011011110	bdc7bfdc3fb8de
UTF-8	叔須롆五	111001011000111110010100111010011010000010001000111010111010000110000110111001001011101010010100	e58f94e9a088eba186e4ba94
UHC	叔須롆五	1110001011010010111000101100111010001110110011001110011111101001	e2d2e2ce8ecce7e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)