Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	杖?第肯?宰	10001111111100010011111110010001111001101000110101101101001111111000110111001001	8ff13f91e68d6d3f8dc9
EUC-JP	杖?第肯?宰	10111110111100110011111111000010111010001011100111001110001111111011101011001011	bef33fc2e8b9ce3fbacb
UTF-8	杖렱第肯걋宰	111001101001110110010110111010111010000010110001111001111010110010101100111010001000001010101111111010101011000110001011111001011010111010110000	e69d96eba0b1e7acace882afeab18be5aeb0
UHC	杖렱第肯걋宰	111011011110100010001110101111101111000010101111110100001110100110110000110000001110111010100101	ede88ebef0afd0e9b0c0eea5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)