Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???BF	0011111100111111001111110100001001000110	3f3f3f4246
SJIS-WIN	鈺ュ〉BF	1111101111000100100000111000010110000001011100100100001001000110	fbc4838581724246
EUC-JP	鈺ュ〉BF	100011111110001111010101101001011110010110100001110100110100001001000110	8fe3d5a5e5a1d34246
UTF-8	鈺ュ〉BF	1110100110001000101110101110001110000011101001011110001110000000100010010100001001000110	e988bae383a5e380894246
UHC	鈺ュ〉BF	1110100010101101101010111110010110100001101101010100001001000110	e8adabe5a1b54246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)