Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????????	001111110011111100111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f3f3f
SJIS-WIN	???竊??怨??	0011111100111111001111111110001010000110001111110011111110001001100001010011111100111111	3f3f3fe2863f3f89853f3f
EUC-JP	???竊??怨??	0011111100111111001111111110001111100110001111110011111110110001111001010011111100111111	3f3f3fe3e63f3fb1e53f3f
UTF-8	嶺뚭낮竊뚥빳怨⑹꽑	111011111010011010101011111010111001101010101101111010111000001010101110111001111010101110001010111010111001101010100101111010111011100110110011111001101000000010101000111000101001000110111001111010101011110110010001	efa6abeb9aadeb82aee7ab8aeb9aa5ebb9b3e680a8e291b9eabd91
UHC	嶺뚭낮竊뚥빳怨⑹꽑	111001111010110110001100111010101011001110110111111011111011110010001100111001001011101110100101111010101011001110101001111011001000010010100000	e7ad8ceab3b7efbc8ce4bba5eab3a9ec84a0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)