Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???G	00111111001111110011111101000111	3f3f3f47
SJIS-WIN	潟諄緖G	10001010100000111110011001111000111110111000111001000111	8a83e678fb8e47
EUC-JP	潟諄?G	101100111110001111101011110110010011111101000111	b3e3ebd93f47
UTF-8	潟諄緖G	11100110101111011001111111101000101010111000010011100111101101111001011001000111	e6bd9fe8ab84e7b79647
UHC	潟諄緖G	11100000101100111110001011110100110111111111110101000111	e0b3e2f4dffd47

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)