Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	謗袴誤潔	1110011010001110100011001101000110001100111010111000110010001001	e68e8cd18ceb8c89
EUC-JP	謗袴誤潔	1110101111101110101110001101001110111000111011011011011111101001	ebeeb8d3b8edb7e9
UTF-8	謗袴誤潔	111010001010110010010111111010001010001010110100111010001010101010100100111001101011110110010100	e8ac97e8a2b4e8aaa4e6bd94
UHC	謗袴誤潔	1101101110111111110011011100110111101000101001101100110010111110	dbbfcdcde8a6ccbe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)