Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	莢苦じ	111001001011000010001011111010101000001010110110	e4b08bea82b6
EUC-JP	莢苦じ	111010001011001010110110111011001010010010111000	e8b2b6eca4b8
UTF-8	莢苦じ	111010001000111010100010111010001000101110100110111000111000000110011000	e88ea2e88ba6e38198
UHC	莢苦じ	111110101111100011001101110010001010101010111000	faf8cdc8aab8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)