To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 掖⑤?壓?????壤ル?掖⑤?壓?????壤ル?B 1001110101110100100001110100010000111111100110101101100000111111001111110011111100111111001111111001101011011111100000111000101100111111100111010111010010000111010001000011111110011010110110000011111100111111001111110011111100111111100110101101111110000011100010110011111101000010 9d7487443f9ad83f3f3f3f3f9adf838b3f9d7487443f9ad83f3f3f3f3f9adf838b3f42
EUC-JP 掖??壓?????壤ル?掖??壓?????壤ル?B 110110011101010100111111001111111101010011011010001111110011111100111111001111110011111111010100111000011010010111101011001111111101100111010101001111110011111111010100110110100011111100111111001111110011111100111111110101001110000110100101111010110011111101000010 d9d53f3fd4da3f3f3f3f3fd4e1a5eb3fd9d53f3fd4da3f3f3f3f3fd4e1a5eb3f42
UTF-8 掖⑤젙壓꾩컮溜곕젽壤ル에掖⑤젙壓꾩컮溜곕젽壤ル에B 11100110100011101001011011100010100100011010010011101100101000001001100111100101101000111001001111101010101111101010100111101100101110111010111011101111101001111000101111101010101100111001010111101100101000001011110111100101101000111010010011100011100000111010101111101100100101111001000011100110100011101001011011100010100100011010010011101100101000001001100111100101101000111001001111101010101111101010100111101100101110111010111011101111101001111000101111101010101100111001010111101100101000001011110111100101101000111010010011100011100000111010101111101100100101111001000001000010 e68e96e291a4eca099e5a393eabea9ecbbaeefa78beab395eca0bde5a3a4e383abec9790e68e96e291a4eca099e5a393eabea9ecbbaeefa78beab395eca0bde5a3a4e383abec979042
UHC 掖⑤젙壓꾩컮溜곕젽壤ル에掖⑤젙壓꾩컮溜곕젽壤ル에B 11100100111110101010100011101011101000001001010111100100111000101000010011101100101100001001010011101010111111101011000011101011101000001010111111100101101111011010101111101011101111111010000111100100111110101010100011101011101000001001010111100100111000101000010011101100101100001001010011101010111111101011000011101011101000001010111111100101101111011010101111101011101111111010000101000010 e4faa8eba095e4e284ecb094eafeb0eba0afe5bdabebbfa1e4faa8eba095e4e284ecb094eafeb0eba0afe5bdabebbfa142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)