To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥?????儀??碍??異?┃???蘂??異 1001101010001011001111110011111100111111001111110011111110001011010101100011111100111111100010100101011000111111001111111000100011011001001111111000010010101011001111110011111100111111111001010100000100111111001111111000100011011001 9a8b3f3f3f3f3f8b563f3f8a563f3f88d93f84ab3f3f3fe5413f3f88d9
EUC-JP 嚥??瑗??儀??碍??異?┃洹??蘂??異 110100111110101100111111001111111000111111001100110000000011111100111111101101011011011100111111001111111011001110110111001111110011111110110000110110110011111110101000101011011000111111000111101110100011111100111111111010011010001000111111001111111011000011011011 d3eb3f3f8fccc03f3fb5b73f3fb3b73f3fb0db3fa8ad8fc7ba3f3fe9a23f3fb0db
UTF-8 嚥싳쉸瑗룡끽儀숈춷碍⑹룇異룬┃洹섎쳛蘂뚳퐞異 111001011001101010100101111011001000101110110011111011001000100110111000111001111001000110010111111010111010001110100001111010111000000110111101111001011000010010000000111011001000100010001000111011001011011010110111111001111010001010001101111000101001000110111001111010111010001110000111111001111001010110110000111010111010001110101100111000101001010010000011111001101011010010111001111011001000010010001110111011001011001110011011111010001001100010000010111010111001101010110011111011011001000010011110111001111001010110110000 e59aa5ec8bb3ec89b8e79197eba3a1eb81bde58480ec8888ecb6b7e7a28de291b9eba387e795b0eba3ace29483e6b4b9ec848eecb39be89882eb9ab3ed909ee795b0
UHC 嚥싳쉸瑗룡끽儀숈춷碍⑹룇異룬┃洹섎쳛蘂뚳퐞異 1110011010111111100110101110110010011010100011101110101010111100101101111110011010110011101000111110101111110000100110011110110010101101100100111110010011110100101010011110110010001111100001101110110010110110101101111110100110100110101011011110101010110111100110001110101110101011100000011110011111011110100011001110111110111101100001111110110010110110 e6bf9aec9a8eeabcb7e6b3a3ebf099ecad93e4f4a9ec8f86ecb6b7e9a6adeab798ebab81e7de8cefbd87ecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)