Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???B	00111111001111110011111101000010	3f3f3f42
SJIS-WIN	鬩崎淹B	11101001101010011000110111101000100111111011100101000010	e9a98de89fb942
EUC-JP	鬩崎淹B	11110010101010111011101011101010110111101011101101000010	f2abbaeadebb42
UTF-8	鬩崎淹B	11101001101011001010100111100101101101001000111011100110101101111011100101000010	e9aca9e5b48ee6b7b942
UHC	?崎淹B	001111111101000011111000111001011111010001000010	3fd0f8e5f442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)