Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	蹂ル?酉	11100110111110001000001110001011001111111001001111010001	e6f8838b3f93d1
EUC-JP	蹂ル?酉	11101100111110101010010111101011001111111100011011010011	ecfaa5eb3fc6d3
UTF-8	蹂ル뜎酉	111010001011100110000010111000111000001110101011111010111001110010001110111010011000010110001001	e8b982e383abeb9c8ee98589
UHC	蹂ル뜎酉	1110101110110011101010111110101110001101100100011110101110110111	ebb3abeb8d91ebb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)