Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???A?	0011111100111111001111110100000100111111	3f3f3f413f
SJIS-WIN	薪??A薪	10010000011001000011111100111111010000011001000001100100	90643f3f419064
EUC-JP	薪??A薪	10111111110001010011111100111111010000011011111111000101	bfc53f3f41bfc5
UTF-8	薪딇뢞A薪	11101000100101101010101011101011100101001000011111101011101000101001111001000001111010001001011010101010	e896aaeb9487eba29e41e896aa
UHC	薪딇뢞A薪	111000111110111110001010111011011000111101011001010000011110001111101111	e3ef8aed8f5941e3ef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)