Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	???飮?┥	0011111100111111001111111001111101011010001111111000010010111100	3f3f3f9f5a3f84bc
EUC-JP	???飮?┥	0011111100111111001111111101110110111011001111111010100010111110	3f3f3fddbb3fa8be
UTF-8	閱뤿㉡飮댐┥	111010011001011010110001111010111010010010111111111000111000100110100001111010011010001110101110111010111000110010010000111000101001010010100101	e996b1eba4bfe389a1e9a3aeeb8c90e294a5
UHC	閱뤿㉡飮댐┥	111001101111001110001111111010111010100010110010111010111110011010110100111011111010011010111110	e6f38feba8b2ebe6b4efa6be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)