Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	薰ｯ逍ｾ蜀	1111101110011110101011111110011110010110101111101110010110000110	fb9eafe796bee586
EUC-JP	?ｯ逍ｾ蜀	001111111000111010101111111011011111011010001110101111101110100111100110	3f8eafedf68ebee9e6
UTF-8	薰ｯ逍ｾ蜀	111010001001011010110000111011111011110110101111111010011000000010001101111011111011110110111110111010001001110010000000	e896b0efbdafe9808defbdbee89c80
UHC	薰?逍?蜀	1111110110111001001111111110000111001110001111111111010110111001	fdb93fe1ce3ff5b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)