To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 娃??庸?????D娃??庸?????D^ 10001000101000010011111100111111100101110110011000111111001111110011111100111111001111110100010010001000101000010011111100111111100101110110011000111111001111110011111100111111001111110100010001011110 88a13f3f97663f3f3f3f3f4488a13f3f97663f3f3f3f3f445e
EUC-JP 娃??庸?????D娃??庸?????D^ 10110000101000110011111100111111110011011100011100111111001111110011111100111111001111110100010010110000101000110011111100111111110011011100011100111111001111110011111100111111001111110100010001011110 b0a33f3fcdc73f3f3f3f3f44b0a33f3fcdc73f3f3f3f3f445e
UTF-8 娃좊젙庸롳쪓溜쒕졁D娃좊젙庸롳쪓溜쒕졁D^ 111001011010100010000011111011001010001010001010111011001010000010011001111001011011101010111000111010111010000110110011111011001010101010010011111011111010011110001011111011001001001010010101111011001010000110000001010001001110010110101000100000111110110010100010100010101110110010100000100110011110010110111010101110001110101110100001101100111110110010101010100100111110111110100111100010111110110010010010100101011110110010100001100000010100010001011110 e5a883eca28aeca099e5bab8eba1b3ecaa93efa78bec9295eca18144e5a883eca28aeca099e5bab8eba1b3ecaa93efa78bec9295eca181445e
UHC 娃좊젙庸롳쪓溜쒕졁D娃좊젙庸롳쪓溜쒕졁D^ 111010001101111110100000111010111010000010010101111010011011110010001110111011111010010110001101111010101111111010011100111010111010000010110010010001001110100011011111101000001110101110100000100101011110100110111100100011101110111110100101100011011110101011111110100111001110101110100000101100100100010001011110 e8dfa0eba095e9bc8eefa58deafe9ceba0b244e8dfa0eba095e9bc8eefa58deafe9ceba0b2445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)