Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	胄傷羌	111000111111001010001111100111011110001110110011	e3f28f9de3b3
EUC-JP	胄傷羌	111001101111010010111101111111011110011010110101	e6f4bdfde6b5
UTF-8	胄傷羌	111010001000001110000100111001011000001010110111111001111011111010001100	e88384e582b7e7be8c
UHC	胄傷羌	111100011011000011011111101111111100101110110110	f1b0dfbfcbb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)