To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????柔??筌??誼??怨??厭??釗 00111111001111110011111100111111001111110011111110001111010111110011111100111111111000101010001100111111001111111000101101100010001111110011111110001001100001010011111100111111100010010111110100111111001111111111101110111011 3f3f3f3f3f3f8f5f3f3fe2a33f3f8b623f3f89853f3f897d3f3ffbbb
EUC-JP ??????柔??筌??誼??怨??厭??釗 0011111100111111001111110011111100111111001111111011110111000000001111110011111111100100101001010011111100111111101101011100001100111111001111111011000111100101001111110011111110110001110111100011111100111111100011111110001110100110 3f3f3f3f3f3fbdc03f3fe4a53f3fb5c33f3fb1e53f3fb1de3f3f8fe3a6
UTF-8 閱뤿툖留㏝씣柔곗춷筌뚯슦誼놅쫫怨뚯뫒厭묐뿦釗 111010011001011010110001111010111010010010111111111011011000100010010110111011111010011110001101111000111000111110011101111011001001010010100011111001101001111110010100111010101011001110010111111011001011011010110111111001111010110110001100111010111001101010101111111011001000101010100110111010001010101010111100111010111000011010000101111011001010101110101011111001101000000010101000111010111001101010101111111010111010101110010010111001011000111010101101111010111010110010010000111010111011111110100110111010011000011110010111 e996b1eba4bfed8896efa78de38f9dec94a3e69f94eab397ecb6b7e7ad8ceb9aafec8aa6e8aabceb8685ecababe680a8eb9aafebab92e58eadebac90ebbfa6e98797
UHC 閱뤿툖留㏝씣柔곗춷筌뚯슦誼놅쫫怨뚯뫒厭묐뿦釗 1110011011110011100011111110101110111000100011011110101110100111101001111110100110011101101101111110101011110101101100001110110010101101100100111110111110100111100011001110110010011010101100001110101111111110100001101110111110100110100001001110101010110011100011001110110010010001101101001110011011110100100100011110101110010111101001101110000111110010 e6f38febb88deba7a7e99db7eaf5b0ecad93efa78cec9ab0ebfe86efa684eab38cec91b4e6f491eb97a6e1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)