Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???G	00111111001111110011111101000111	3f3f3f47
SJIS-WIN	唯?炬G	100101110100001000111111111000000111100001000111	97423fe07847
EUC-JP	唯?炬G	110011011010001100111111110111111101100101000111	cda33fdfd947
UTF-8	唯미炬G	11100101100101001010111111101011101011111011100011100111100000101010110001000111	e594afebafb8e782ac47
UHC	唯미炬G	11101010111001101011100111001100110010111110001101000111	eae6b9cccbe347

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)