Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????ø	001111110011111100111111001111110011111111111000	3f3f3f3f3ff8
SJIS-WIN	鼇???ラ?	1110101010000111001111110011111100111111100000111000100100111111	ea873f3f3f83893f
EUC-JP	鼇???ラø	11110011111001110011111100111111001111111010010111101001100011111010100111001100	f3e73f3f3fa5e98fa9cc
UTF-8	鼇믣쨶略ラø	1110100110111100100001111110101110101111101000111110110010101000101101101110111110100101101101101110001110000011101010011100001110111000	e9bc87ebafa3eca8b6efa5b6e383a9c3b8
UHC	鼇믣쨶略ラø	111010001010100010010010111001011010010010010000111001011011001010101011111010011010100110101010	e8a892e5a490e5b2abe9a9aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)