Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????^	00111111001111110011111100111111001111110011111101011110	3f3f3f3f3f3f5e
SJIS-WIN	牙??牙??^	100010011110010100111111001111111000100111100101001111110011111101011110	89e53f3f89e53f3f5e
EUC-JP	牙??牙??^	101100101110011100111111001111111011001011100111001111110011111101011110	b2e73f3fb2e73f3f5e
UTF-8	牙뚨떢牙뚨떢^	11100111100010011001100111101011100110101010100011101011100101101010001011100111100010011001100111101011100110101010100011101011100101101010001001011110	e78999eb9aa8eb96a2e78999eb9aa8eb96a25e
UHC	牙뚨떢牙뚨떢^	11100100101100111000110011100111100010111011011011100100101100111000110011100111100010111011011001011110	e4b38ce78bb6e4b38ce78bb65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)