Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??T??	0011111100111111010101000011111100111111	3f3f543f3f
SJIS-WIN	韵自T韵自	111010001110111110001110101010010101010011101000111011111000111010101001	e8ef8ea954e8ef8ea9
EUC-JP	韵自T韵自	111100001111000110111100101010110101010011110000111100011011110010101011	f0f1bcab54f0f1bcab
UTF-8	韵自T韵自	11101001100111111011010111101000100001111010101001010100111010011001111110110101111010001000011110101010	e99fb5e887aa54e99fb5e887aa
UHC	?自T?自	00111111111011011011101101010100001111111110110110111011	3fedbb543fedbb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)