Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??T??B	001111110011111101010100001111110011111101000010	3f3f543f3f42
SJIS-WIN	??T??B	001111110011111101010100001111110011111101000010	3f3f543f3f42
EUC-JP	??T??B	001111110011111101010100001111110011111101000010	3f3f543f3f42
UTF-8	횂혫T횄쨘B	1110110110011010100000101110110110011000101010110101010011101101100110101000010011101100101010001001100001000010	ed9a82ed98ab54ed9a84eca89842
UHC	횂혫T횄쨘B	11000011100000101100001010010011010101001100001110000011110000101011101001000010	c382c29354c383c2ba42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)