Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???B	00111111001111110011111101000010	3f3f3f42
SJIS-WIN	懿ｌ?B	100111001111001010000010100011000011111101000010	9cf2828c3f42
EUC-JP	懿ｌ?B	110110001111010010100011111011000011111101000010	d8f4a3ec3f42
UTF-8	懿ｌ븺B	11100110100001111011111111101111101111011000110011101011101110001011101001000010	e687bfefbd8cebb8ba42
UHC	懿ｌ븺B	11101011111100111010001111101100100101011010001101000010	ebf3a3ec95a342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)