Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	弔?苑?精	1001001010100010001111111000100110010001001111111001000010111000	92a23f89913f90b8
EUC-JP	弔?苑?精	1100010010100100001111111011000111110001001111111100000010111010	c4a43fb1f13fc0ba
UTF-8	弔렚苑렩精	111001011011110010010100111010111010000010011010111010001000101110010001111010111010000010101001111001111011001010111110	e5bc94eba09ae88b91eba0a9e7b2be
UHC	弔렚苑렩精	11110000110000001000111010101101111010101011110110001110101101111110111111110001	f0c08eadeabd8eb7eff1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)