Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?W?W?W?W	0011111101010111001111110101011100111111010101110011111101010111	3f573f573f573f57
SJIS-WIN	叩W叩W叩W叩W	100100100100000001010111100100100100000001010111100100100100000001010111100100100100000001010111	924057924057924057924057
EUC-JP	叩W叩W叩W叩W	110000111010000101010111110000111010000101010111110000111010000101010111110000111010000101010111	c3a157c3a157c3a157c3a157
UTF-8	叩W叩W叩W叩W	11100101100011111010100101010111111001011000111110101001010101111110010110001111101010010101011111100101100011111010100101010111	e58fa957e58fa957e58fa957e58fa957
UHC	叩W叩W叩W叩W	110011011011000001010111110011011011000001010111110011011011000001010111110011011011000001010111	cdb057cdb057cdb057cdb057

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)