Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????H	00111111001111110011111100111111001111110011111101001000	3f3f3f3f3f3f48
SJIS-WIN	ｾｭ自ｾ､竺H	101111101010110110001110101010011011111010100100100011101011000101001000	bead8ea9bea48eb148
EUC-JP	ｾｭ自ｾ､竺H	10001110101111101000111010101101101111001010101110001110101111101000111010100100101111001011001101001000	8ebe8eadbcab8ebe8ea4bcb348
UTF-8	ｾｭ自ｾ､竺H	11101111101111011011111011101111101111011010110111101000100001111010101011101111101111011011111011101111101111011010010011100111101010111011101001001000	efbdbeefbdade887aaefbdbeefbda4e7abba48
UHC	??自??竺H	001111110011111111101101101110110011111100111111111101011110011101001000	3f3fedbb3f3ff5e748

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)