Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	艶?鵝??兀	100010011001000000111111111010100100000000111111001111111001100101011001	89903fea403f3f9959
EUC-JP	艶?鵝??兀	101100011111000000111111111100111010000100111111001111111101000110111010	b1f03ff3a13f3fd1ba
UTF-8	艶쵺鵝녶쁿兀	111010001000100110110110111011001011010110111010111010011011010110011101111010111000010110110110111011001000000110111111111001011000010110000000	e889b6ecb5bae9b59deb85b6ec81bfe58580
UHC	艶쵺鵝녶쁿兀	111001101111110110101101010101001110010010111101100001101110010110011000100001101110100010110100	e6fdad54e4bd86e59886e8b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)