Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???@h	0011111100111111001111110100000001101000	3f3f3f4068
SJIS-WIN	正脛憮@h	1001000010110011111000111111100010011100111000010100000001101000	90b3e3f89ce14068
EUC-JP	正脛憮@h	1100000010110101111001101111101011011000111000110100000001101000	c0b5e6fad8e34068
UTF-8	正脛憮@h	1110011010101101101000111110100010000100100110111110011010000110101011100100000001101000	e6ada3e8849be686ae4068
UHC	正脛憮@h	1110111111100001110011001110101111011001111001000100000001101000	efe1ccebd9e44068

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)