Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	??煩趾	001111110011111110010100110011111110011011100100	3f3f94cfe6e4
EUC-JP	獐?煩趾	1000111111001011101110100011111111001000110100011110110011100110	8fcbba3fc8d1ece6
UTF-8	獐곕煩趾	111001111000110110010000111010101011001110010101111001111000010110101001111010001011011010111110	e78d90eab395e785a9e8b6be
UHC	獐곕煩趾	1110110111101111101100001110101111011011111000011111001010111111	edefb0ebdbe1f2bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)