To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN ???誼??貫誼?z???誼??貫誼?zB 001111110011111100111111100010110110001000111111001111111000101011010001100010110110001000111111011110100011111100111111001111111000101101100010001111110011111110001010110100011000101101100010001111110111101001000010 3f3f3f8b623f3f8ad18b623f7a3f3f3f8b623f3f8ad18b623f7a42
EUC-JP ???誼??貫誼?z???誼??貫誼?zB 001111110011111100111111101101011100001100111111001111111011010011010011101101011100001100111111011110100011111100111111001111111011010111000011001111110011111110110100110100111011010111000011001111110111101001000010 3f3f3fb5c33f3fb4d3b5c33f7a3f3f3fb5c33f3fb4d3b5c33f7a42
UTF-8 捻곗옓誼숁굝貫誼팤z捻곗옓誼숁굝貫誼팤zB 111011111010011010100100111010101011001110010111111011001001100010010011111010001010101010111100111011001000100010000001111010101011010110011101111010001011001010101011111010001010101010111100111011011000110010100100011110101110111110100110101001001110101010110011100101111110110010011000100100111110100010101010101111001110110010001000100000011110101010110101100111011110100010110010101010111110100010101010101111001110110110001100101001000111101001000010 efa6a4eab397ec9893e8aabcec8881eab59de8b2abe8aabced8ca47aefa6a4eab397ec9893e8aabcec8881eab59de8b2abe8aabced8ca47a42
UHC 捻곗옓誼숁굝貫誼팤z捻곗옓誼숁굝貫誼팤zB 111001101111011110110000111011001001111010011001111010111111111010011001111001101000001010000101110011101011101111101011111111101011101101100001011110101110011011110111101100001110110010011110100110011110101111111110100110011110011010000010100001011100111010111011111010111111111010111011011000010111101001000010 e6f7b0ec9e99ebfe99e68285cebbebfebb617ae6f7b0ec9e99ebfe99e68285cebbebfebb617a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)