Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??×???	001111110011111111010111001111110011111100111111	3f3fd73f3f3f
SJIS-WIN	櫻?×違??	100111110100111000111111100000010111111010001000111000010011111100111111	9f4e3f817e88e13f3f
EUC-JP	櫻?×違??	110111011010111100111111101000011101111110110000111000110011111100111111	ddaf3fa1dfb0e33f3f
UTF-8	櫻뗭×違덂죰	1110011010101011101110111110101110010111101011011100001110010111111010011000000110010101111010111000110110000010111011001010001110110000	e6abbbeb97adc397e98195eb8d82eca3b0
UHC	櫻뗭×違덂죰	111001011010000110001011111011001010000110111111111010101101111010001000111001011010000110001011	e5a18beca1bfeade88e5a18b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)