Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???LB	0011111100111111001111110100110001000010	3f3f3f4c42
SJIS-WIN	谷歎続LB	1001001001001010100100100101011010010001101100010100110001000010	924a925691b14c42
EUC-JP	谷歎続LB	1100001110101011110000111011011111000010101100110100110001000010	c3abc3b7c2b34c42
UTF-8	谷歎続LB	1110100010110000101101111110011010101101100011101110011110110110100110100100110001000010	e8b0b7e6ad8ee7b69a4c42
UHC	谷歎?LB	11001101110110111111011110100111001111110100110001000010	cddbf7a73f4c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)