To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 »óìªQ»óìªS»óìªE»óìªX 100011111011101111110011111011001010101001010001100011111011101111110011111011001010101001010011100011111011101111110011111011001010101001000101100011111011101111110011111011001010101001011000 8fbbf3ecaa518fbbf3ecaa538fbbf3ecaa458fbbf3ecaa58
SJIS-WIN ?????Q?????S?????E?????X 001111110011111100111111001111110011111101010001001111110011111100111111001111110011111101010011001111110011111100111111001111110011111101000101001111110011111100111111001111110011111101011000 3f3f3f3f3f513f3f3f3f3f533f3f3f3f3f453f3f3f3f3f58
EUC-JP ??óìªQ??óìªS??óìªE??óìªX 001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001010001001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001010011001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001000101001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001011000 3f3f8fabd18fabc08fa2ec513f3f8fabd18fabc08fa2ec533f3f8fabd18fabc08fa2ec453f3f8fabd18fabc08fa2ec58
UTF-8 »óìªQ»óìªS»óìªE»óìªX 1100001010001111110000101011101111000011101100111100001110101100110000101010101001010001110000101000111111000010101110111100001110110011110000111010110011000010101010100101001111000010100011111100001010111011110000111011001111000011101011001100001010101010010001011100001010001111110000101011101111000011101100111100001110101100110000101010101001011000 c28fc2bbc3b3c3acc2aa51c28fc2bbc3b3c3acc2aa53c28fc2bbc3b3c3acc2aa45c28fc2bbc3b3c3acc2aa58
UHC ????ªQ????ªS????ªE????ªX 00111111001111110011111100111111101010001010001101010001001111110011111100111111001111111010100010100011010100110011111100111111001111110011111110101000101000110100010100111111001111110011111100111111101010001010001101011000 3f3f3f3fa8a3513f3f3f3fa8a3533f3f3f3fa8a3453f3f3f3fa8a358

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)