Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??	0011111100111111	3f3f
SJIS-WIN	惜洩	10010000110010011000100101101011	90c9896b
EUC-JP	惜洩	11000000110010111011000111001100	c0cbb1cc
UTF-8	惜洩	111001101000001110011100111001101011010010101001	e6839ce6b4a9
UHC	惜洩	11100000101011011110000011011101	e0ade0dd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)