Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??\?	00111111001111110101110000111111	3f3f5c3f
SJIS-WIN	歎測\辰	10010010010101101001000110101010010111001001001001000011	925691aa5c9243
EUC-JP	歎測\辰	11000011101101111100001010101100010111001100001110100100	c3b7c2ac5cc3a4
UTF-8	歎測\辰	11100110101011011000111011100110101110001010110001011100111010001011111010110000	e6ad8ee6b8ac5ce8beb0
UHC	歎測\辰	11110111101001111111011010110100010111001111001011100011	f7a7f6b45cf2e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)