Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	蔗ｸ鮨戎	11100100111100101011100011101001101111011000111101011110	e4f2b8e9bd8f5e
EUC-JP	蔗ｸ鮨戎	1110100011110100100011101011100011110010101111111011110110111111	e8f48eb8f2bfbdbf
UTF-8	蔗ｸ鮨戎	111010001001010010010111111011111011110110111000111010011010111010101000111001101000100010001110	e89497efbdb8e9aea8e6888e
UHC	蔗??戎	111011011011110100111111001111111110101111010100	edbd3f3febd4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)