Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?T?TB	0011111101010100001111110101010001000010	3f543f5442
SJIS-WIN	奪T奪TB	10010010010001000101010010010010010001000101010001000010	92445492445442
EUC-JP	奪T奪TB	11000011101001010101010011000011101001010101010001000010	c3a554c3a55442
UTF-8	奪T奪TB	111001011010010110101010010101001110010110100101101010100101010001000010	e5a5aa54e5a5aa5442
UHC	奪T奪TB	11110111101011000101010011110111101011000101010001000010	f7ac54f7ac5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)