Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	淨瀧紋鵄	1001111111000100100100011110101110010110111001001110100111110101	9fc491eb96e4e9f5
EUC-JP	淨瀧紋鵄	1101111011000110110000101110110111001100111001101111001011110111	dec6c2edcce6f2f7
UTF-8	淨瀧紋鵄	111001101011011110101000111001111000000010100111111001111011010010001011111010011011010110000100	e6b7a8e780a7e7b48be9b584
UHC	淨瀧紋?	11101111111001001101011011101001110110101010001100111111	efe4d6e9daa33f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)