To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 »óìªA»óìªZ»óìªE»óìªL 100011111011101111110011111011001010101001000001100011111011101111110011111011001010101001011010100011111011101111110011111011001010101001000101100011111011101111110011111011001010101001001100 8fbbf3ecaa418fbbf3ecaa5a8fbbf3ecaa458fbbf3ecaa4c
SJIS-WIN ?????A?????Z?????E?????L 001111110011111100111111001111110011111101000001001111110011111100111111001111110011111101011010001111110011111100111111001111110011111101000101001111110011111100111111001111110011111101001100 3f3f3f3f3f413f3f3f3f3f5a3f3f3f3f3f453f3f3f3f3f4c
EUC-JP ??óìªA??óìªZ??óìªE??óìªL 001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001000001001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001011010001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001000101001111110011111110001111101010111101000110001111101010111100000010001111101000101110110001001100 3f3f8fabd18fabc08fa2ec413f3f8fabd18fabc08fa2ec5a3f3f8fabd18fabc08fa2ec453f3f8fabd18fabc08fa2ec4c
UTF-8 »óìªA»óìªZ»óìªE»óìªL 1100001010001111110000101011101111000011101100111100001110101100110000101010101001000001110000101000111111000010101110111100001110110011110000111010110011000010101010100101101011000010100011111100001010111011110000111011001111000011101011001100001010101010010001011100001010001111110000101011101111000011101100111100001110101100110000101010101001001100 c28fc2bbc3b3c3acc2aa41c28fc2bbc3b3c3acc2aa5ac28fc2bbc3b3c3acc2aa45c28fc2bbc3b3c3acc2aa4c
UHC ????ªA????ªZ????ªE????ªL 00111111001111110011111100111111101010001010001101000001001111110011111100111111001111111010100010100011010110100011111100111111001111110011111110101000101000110100010100111111001111110011111100111111101010001010001101001100 3f3f3f3fa8a3413f3f3f3fa8a35a3f3f3f3fa8a3453f3f3f3fa8a34c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)