Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	?繹鴦嗽	00111111111000111000100011101001111100011001101001110101	3fe388e9f19a75
EUC-JP	?繹鴦嗽	00111111111001011110100011110010111100111101001111010110	3fe5e8f2f3d3d6
UTF-8	쎐繹鴦嗽	111011001000111010010000111001111011100110111001111010011011010010100110111001011001011110111101	ec8e90e7b9b9e9b4a6e597bd
UHC	쎐繹鴦嗽	1001101110110111111001101011101011100100111011001110000111110101	9bb7e6bae4ece1f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)