Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	玉?／俺	10001011110010100011111110000001010111101000100110110100	8bca3f815e89b4
EUC-JP	玉?／俺	10110110110011000011111110100001101111111011001010110110	b6cc3fa1bfb2b6
UTF-8	玉뉒／俺	111001111000111010001001111010111000100110010010111011111011110010001111111001001011111110111010	e78e89eb8992efbc8fe4bfba
UHC	玉뉒／俺	1110100010101100100001111110011110100011101011111110010111101111	e8ac87e7a3afe5ef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)