Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	??醍??訟	0011111100111111100100011110011100111111001111111000111111010111	3f3f91e73f3f8fd7
EUC-JP	??醍??訟	0011111100111111110000101110100100111111001111111011111011011001	3f3fc2e93f3fbed9
UTF-8	履렰醍당뤈訟	111011111010011110011111111010111010000010110000111010011000011010001101111010111000101110111001111010111010010010001000111010001010100010011111	efa79feba0b0e9868deb8bb9eba488e8a89f
UHC	履렰醍당뤈訟	111011001010101010001110101111011111000010110101101101001110011110001111101110001110000111101000	ecaa8ebdf0b5b4e78fb8e1e8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)