To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???撓η?亥??跡???譯??f?跡 0011111100111111001111111001110110011010100000111100010100111111100010001110010100111111001111111001000011010101001111110011111100111111111001101010000100111111001111111000001010000110001111111001000011010101 3f3f3f9d9a83c53f88e53f3f90d53f3f3fe6a13f3f82863f90d5
EUC-JP 縕??撓η?亥??跡?縕?譯??f?跡 100011111101010011000010001111110011111111011001111110101010011011000111001111111011000011100111001111110011111111000000110101110011111110001111110101001100001000111111111011001010001100111111001111111010001111100110001111111100000011010111 8fd4c23f3fd9faa6c73fb0e73f3fc0d73f8fd4c23feca33f3fa3e63fc0d7
UTF-8 縕됵슴撓η슢亥썽츝跡뼁縕됳譯욥뽪f퀗跡 1110011110111000100101011110101110010000101101011110110010001010101101001110011010010010100100111100111010110111111011001000101010100010111001001011101010100101111011001000110110111101111011001011100010011101111010001011011110100001111010111011110010000001111001111011100010010101111010111001000010110011111010001010110110101111111011001001101010100101111010111011110110101010111011111011110110000110111011011000000010010111111010001011011110100001 e7b895eb90b5ec8ab4e69293ceb7ec8aa2e4baa5ec8dbdecb89de8b7a1ebbc81e7b895eb90b3e8adafec9aa5ebbdaaefbd86ed8097e8b7a1
UHC 縕됵슴撓η슢亥썽츝跡뼁縕됳譯욥뽪f퀗跡 1110100010110010100010011110111110111101101111111110100011110101101001011110011110011010101011101111101010100100101111011110100110101110100101101110111011100110101110111011111111101000101100101000100111101110111001101011101110111111111010011001011011100110101000111110011010110011100011001110111011100110 e8b289efbdbfe8f5a5e79aaefaa4bde9ae96eee6bbbfe8b289eee6bbbfe996e6a3e6b38ceee6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)