Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	羞???鴉	11100011101101010011111100111111001111111110100111101011	e3b53f3f3fe9eb
EUC-JP	羞???鴉	11100110101101110011111100111111001111111111001011101101	e6b73f3f3ff2ed
UTF-8	羞곇땶둆鴉	111001111011111010011110111010101011001110000111111010111001010110110110111010111001000110000110111010011011010010001001	e7be9eeab387eb95b6eb9186e9b489
UHC	羞곇땶둆鴉	11100010101110101000000111001100100010111000110010001010010000101110010010111100	e2ba81cc8b8c8a42e4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)