Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????H	0011111100111111001111110011111101001000	3f3f3f3f48
SJIS-WIN	???恃H	001111110011111100111111100111001001010001001000	3f3f3f9c9448
EUC-JP	???恃H	001111110011111100111111110101111111010001001000	3f3f3fd7f448
UTF-8	쑹렻쒔恃H	11101100100100011011100111101011101000001011101111101100100100101001010011100110100000011000001101001000	ec91b9eba0bbec9294e6818348
UHC	쑹렻쒔恃H	101111101010101110001110110000111011111010101101111000111011111001001000	beab8ec3beade3be48

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)