Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	阡包ｾ擾ｾ柯	11101000100101001001010111101111101111101000111111101111101111101001111001101000	e89495efbe8fefbe9e68
EUC-JP	阡包ｾ擾ｾ柯	111011111111010011001010111100011000111010111110101111101111000110001110101111101101101111001001	eff4caf18ebebef18ebedbc9
UTF-8	阡包ｾ擾ｾ柯	111010011001100010100001111001011000110010000101111011111011110110111110111001101001001110111110111011111011110110111110111001101001111110101111	e998a1e58c85efbdbee693beefbdbee69faf
UHC	阡包?擾?柯	11110100110001101111100011010000001111111110100011110110001111111100101010101111	f4c6f8d03fe8f63fcaaf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)