Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	遽袴奇ｽｺ	1110011110101111100011001101000110001010111011111011110110111010	e7af8cd18aefbdba
EUC-JP	遽袴奇ｽｺ	11101110101100011011100011010011101101001111000110001110101111011000111010111010	eeb1b8d3b4f18ebd8eba
UTF-8	遽袴奇ｽｺ	111010011000000110111101111010001010001010110100111001011010010110000111111011111011110110111101111011111011110110111010	e981bde8a2b4e5a587efbdbdefbdba
UHC	遽袴奇??	1100101111101000110011011100110111010000111101000011111100111111	cbe8cdcdd0f43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)