To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮?ぜ淫??汚?????柔レ?沃 0011111100111111001111111110100001001010001111111000001010111010100010001111101000111111001111111000100110011000001111110011111100111111001111110011111110001111010111111000001110001100001111111001011110000000 3f3f3fe84a3f82ba88fa3f3f89983f3f3f3f3f8f5f838c3f9780
EUC-JP ???鍮?ぜ淫??汚??堉??柔レ?沃 00111111001111110011111111101111101010110011111110100100101111001011000011111100001111110011111110110001111110000011111100111111100011111011011111111101001111110011111110111101110000001010010111101100001111111100110111100000 3f3f3fefab3fa4bcb0fc3f3fb1f83f3f8fb7fd3f3fbdc0a5ec3fcde0
UTF-8 捻꿸낯鍮뽬ぜ淫딆땡汚살늾堉ㅷ춯柔レ뵯沃 111011111010011010100100111010101011111110111000111010111000001010101111111010011000110110101110111010111011110110101100111000111000000110011100111001101011011110101011111010111001010010000110111010111001010110100001111001101011000110011010111011001000001010110100111010111000101010111110111001011010000010001001111000111000010110110111111011001011011010101111111001101001111110010100111000111000001110101100111010111011010110101111111001101011001010000011 efa6a4eabfb8eb82afe98daeebbdace3819ce6b7abeb9486eb95a1e6b19aec82b4eb8abee5a089e385b7ecb6afe69f94e383acebb5afe6b283
UHC 捻꿸낯鍮뽬ぜ淫딆땡汚살늾堉ㅷ춯柔レ뵯沃 1110011011110111101100101110101010110011101110001110101110111001100101101110100010101010101111001110101111100010100010101110110010110110101011111110011111111101101110111110110010001000100001111110101110111100101001001110011110101101100011001110101011110101101010111110110010010100101011011110100010101010 e6f7b2eab3b8ebb996e8aabcebe28aecb6afe7fdbbec8887ebbca4e7ad8ceaf5abec94ade8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)