Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	?撤呈??逕糾	0011111110010011010100001001001011100110001111110011111111100111100101001000101110001010	3f935092e63f3fe7948b8a
EUC-JP	?撤呈??逕糾	0011111111000101101100011100010011101000001111110011111111101101111101001011010111101010	3fc5b1c4e83f3fedf4b5ea
UTF-8	뤋撤呈쨵샅逕糾	111010111010010010001011111001101001001010100100111001011001000110001000111011001010100010110101111011001000001110000101111010011000000010010101111001111011001110111110	eba48be692a4e59188eca8b5ec8385e98095e7b3be
UHC	뤋撤呈쨵샅逕糾	1000111110111011111101001100110011101111110100001010010010001111101110111111010011001100111011111101000010101100	8fbbf4ccefd0a48fbbf4ccefd0ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)