Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	鈬ｴ辜狗羔辜毅	11100111111000111011010011100111100000111000101111100111111000111011010011100111100000111000101101000010	e7e3b4e7838be7e3b4e7838b42
EUC-JP	鈬ｴ辜狗羔辜毅	1110111011100101100011101011010011101101111000111011011011101001111001101011011011101101111000111011010110100011	eee58eb4ede3b6e9e6b6ede3b5a3
UTF-8	鈬ｴ辜狗羔辜毅	111010011000100010101100111011111011110110110100111010001011111010011100111001111000101110010111111001111011111010010100111010001011111010011100111001101010111110000101	e988acefbdb4e8be9ce78b97e7be94e8be9ce6af85
UHC	??辜狗羔辜毅	001111110011111111001101110100001100111110110111110011011100010011001101110100001110101111110110	3f3fcdd0cfb7cdc4cdd0ebf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)