Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???A	00111111001111110011111101000001	3f3f3f41
SJIS-WIN	麗本癈A	10010111111011011001011001111011111000011001001101000001	97ed967be19341
EUC-JP	麗本癈A	11001110111011111100101111011100111000011111001101000001	ceefcbdce1f341
UTF-8	麗本癈A	11101001101110101001011111100110100111001010110011100111100110011000100001000001	e9ba97e69cace7998841
UHC	麗本?A	110101011111001011011100111000100011111101000001	d5f2dce23f41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)