Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????Z	001111110011111100111111001111110011111101011010	3f3f3f3f3f5a
SJIS-WIN	遏ｭ蟄俶據Z	11100111100111111010110111100101101011011001100011100110100111011001111101011010	e79fade5ad98e69d9f5a
EUC-JP	遏ｭ蟄俶據Z	1110111010100001100011101010110111101010101011111101000011101000110110101010000101011010	eea18eadeaafd0e8daa15a
UTF-8	遏ｭ蟄俶據Z	11101001100000011000111111101111101111011010110111101000100111111000010011100100101111111011011011100110100100111001101001011010	e9818fefbdade89f84e4bfb6e6939a5a
UHC	??蟄?據Z	0011111100111111111101101101111000111111110010111110000001011010	3f3ff6de3fcbe05a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)