Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????T	0011111100111111001111110011111101010100	3f3f3f3f54
SJIS-WIN	棚蔵臓短T	100100100100100110010001101000001001000110011111100100100101101001010100	924991a0919f925a54
EUC-JP	棚蔵臓短T	110000111010101011000010101000101100001010100001110000111011101101010100	c3aac2a2c2a1c3bb54
UTF-8	棚蔵臓短T	11100110101000111001101011101000100101001011010111101000100001111001001111100111100111111010110101010100	e6a39ae894b5e88793e79fad54
UHC	棚??短T	11011101110111000011111100111111110100111010110101010100	dddc3f3fd3ad54

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)