Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	??砥?雋麥?	00111111001111111001001101110101001111111110100010110010111010100110110100111111	3f3f93753fe8b2ea6d3f
EUC-JP	珽?砥?雋麥?	100011111100101111111110001111111100010111010110001111111111000010110100111100111100111000111111	8fcbfe3fc5d63ff0b4f3ce3f
UTF-8	珽렖砥렫雋麥륏	111001111000111110111101111010111010000010010110111001111010000010100101111010111010000010101011111010011001101110001011111010011011101010100101111010111010010110001111	e78fbdeba096e7a0a5eba0abe99b8be9baa5eba58f
UHC	珽렖砥렫雋麥륏	1110111111101010100011101010101111110010101100101000111010111001111100011110011011011000111010101011011111110111	efea8eabf2b28eb9f1e6d8eab7f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)