To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???諭ら?矣??艶l?而?????吾 0011111100111111001111111001011101000000100000101110011100111111111000011110000100111111001111111000100110010000100000101000110000111111100011101010011100111111001111110011111100111111001111111000110011100001 3f3f3f974082e73fe1e13f3f8990828c3f8ea73f3f3f3f3f8ce1
EUC-JP ???諭ら?矣??艶l?而??洹??吾 00111111001111110011111111001101101000011010010011101001001111111110001011100011001111110011111110110001111100001010001111101100001111111011110010101001001111110011111110001111110001111011101000111111001111111011100011100011 3f3f3fcda1a4e93fe2e33f3fb1f0a3ec3fbca93f3f8fc7ba3f3fb8e3
UTF-8 閱묐갭諭ら걬矣ㅻ룥艶l꼷而숂솻洹섎쳷吾 111010011001011010110001111010111010110010010000111010101011000010101101111010001010101110101101111000111000001010001001111010101011000110101100111001111001111110100011111000111000010110111011111010111010001110100101111010001000100110110110111011111011110110001100111010101011110010110111111010001000000010001100111011001000100010000010111011001000011010111011111001101011010010111001111011001000010010001110111011001011001110110111111001011001000010111110 e996b1ebac90eab0ade8abade38289eab1ace79fa3e385bbeba3a5e889b6efbd8ceabcb7e8808cec8882ec86bbe6b4b9ec848eecb3b7e590be
UHC 閱묐갭諭ら걬矣ㅻ룥艶l꼷而숂솻洹섎쳷吾 1110011011110011100100011110101110110000101110001110101110110001101010101110100110000001100101011110101111111000101001001110101110001111100111101110011011111101101000111110110010000100100011111110110010111011100110011110011110011001101100001110101010110111100110001110101110101011100110101110011111101110 e6f391ebb0b8ebb1aae98195ebf8a4eb8f9ee6fda3ec848fecbb99e799b0eab798ebab9ae7ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)