Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??i??iB	00111111001111110110100100111111001111110110100101000010	3f3f693f3f6942
SJIS-WIN	謝移i謝移iB	1000111011010011100010001101101001101001100011101101001110001000110110100110100101000010	8ed388da698ed388da6942
EUC-JP	謝移i謝移iB	1011110011010101101100001101110001101001101111001101010110110000110111000110100101000010	bcd5b0dc69bcd5b0dc6942
UTF-8	謝移i謝移iB	111010001010110010011101111001111010011110111011011010011110100010101100100111011110011110100111101110110110100101000010	e8ac9de7a7bb69e8ac9de7a7bb6942
UHC	謝移i謝移iB	1101111011110011111011001011100101101001110111101111001111101100101110010110100101000010	def3ecb969def3ecb96942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)