Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	蟆溢棆鬟幄ｫ	1110010110110000100010001110110010011110101100001110100110100011100110111110100010101011	e5b088ec9eb0e9a39be8ab
EUC-JP	蟆溢棆鬟幄ｫ	111010101011001010110000111011101101110010110010111100101010010111010110111010101000111010101011	eab2b0eedcb2f2a5d6ea8eab
UTF-8	蟆溢棆鬟幄ｫ	111010001001111110000110111001101011101010100010111001101010001110000110111010011010110010011111111001011011100110000100111011111011110110101011	e89f86e6baa2e6a386e9ac9fe5b984efbdab
UHC	?溢??幄?	0011111111101100111011100011111100111111111001001100000100111111	3fecee3f3fe4c13f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)