Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	要ι?松?	1001011101110110100000111100011100111111100011111011110000111111	977683c73f8fbc3f
EUC-JP	要ι?松?	1100110111010111101001101100100100111111101111101011111000111111	cdd7a6c93fbebe3f
UTF-8	要ι댋松잹	1110100010100110100000011100111010111001111010111000110010001011111001101001110110111110111011001001111010111001	e8a681ceb9eb8c8be69dbeec9eb9
UHC	要ι댋松잹	11101001101010011010010111101001100010001011010011100001111001101010000001000010	e9a9a5e988b4e1e6a042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)