To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 é™·äº¥äº 1110100110011001101101111110010010111010101001011110010010111010 e999b7e4baa5e4ba
SJIS-WIN ?????¥?? 001111110011111100111111001111110011111110000001100011110011111100111111 3f3f3f3f3f818f3f3f
EUC-JP é??äº?äº 100011111010101110110001001111110011111110001111101010111010001110001111101000101110101100111111100011111010101110100011100011111010001011101011 8fabb13f3f8faba38fa2eb3f8faba38fa2eb
UTF-8 é™·äº¥äº 11000011101010011100001010011001110000101011011111000011101001001100001010111010110000101010010111000011101001001100001010111010 c3a9c299c2b7c3a4c2bac2a5c3a4c2ba
UHC ??·?º??º 0011111100111111101000011010010000111111101010001010110000111111001111111010100010101100 3f3fa1a43fa8ac3f3fa8ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)