Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	蠏ｬ竭ｼ岺ｽ	111001011011010110101100111000101001000110111100111110101010110110111101	e5b5ace291bcfaadbd
EUC-JP	蠏ｬ竭ｼ岺ｽ	11101010101101111000111010101100111000111111000110001110101111001000111110111011101110001000111010111101	eab78eace3f18ebc8fbbb88ebd
UTF-8	蠏ｬ竭ｼ岺ｽ	111010001010000010001111111011111011110110101100111001111010101110101101111011111011110110111100111001011011001010111010111011111011110110111101	e8a08fefbdace7abadefbdbce5b2baefbdbd
UHC	??竭?岺?	0011111100111111110010101110011000111111110101101011100100111111	3f3fcae63fd6b93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)