To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????d}???????????d{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101100100011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011001000111101101011110 3f3f3f3f3f3f3f3f3f3f3f647d3f3f3f3f3f3f3f3f3f3f3f647b5e
SJIS-WIN 賊朧?楮?┳樗??杵?d}賊朧?楮?┳樗??杵?d{^ 100100011010111110011110010011110011111110011110101110000011111110000100101100011001001010010100001111110011111110001011011011100011111101100100011111011001000110101111100111100100111100111111100111101011100000111111100001001011000110010010100101000011111100111111100010110110111000111111011001000111101101011110 91af9e4f3f9eb83f84b192943f3f8b6e3f647d91af9e4f3f9eb83f84b192943f3f8b6e3f647b5e
EUC-JP 賊朧?楮?┳樗??杵?d}賊朧?楮?┳樗??杵?d{^ 110000101011000111011011101100000011111111011100101110100011111110101000101100111100001111110100001111110011111110110101110011110011111101100100011111011100001010110001110110111011000000111111110111001011101000111111101010001011001111000011111101000011111100111111101101011100111100111111011001000111101101011110 c2b1dbb03fdcba3fa8b3c3f43f3fb5cf3f647dc2b1dbb03fdcba3fa8b3c3f43f3fb5cf3f647b5e
UTF-8 賊朧떫楮잴┳樗흘깰杵렪d}賊朧떫楮잴┳樗흘깰杵렪d{^ 1110100010110011100010101110011010011100101001111110101110010110101010111110011010100101101011101110110010011110101101001110001010010100101100111110011010101000100101111110110110011101100110001110101010111001101100001110011010011101101101011110101110100000101010100110010001111101111010001011001110001010111001101001110010100111111010111001011010101011111001101010010110101110111011001001111010110100111000101001010010110011111001101010100010010111111011011001110110011000111010101011100110110000111001101001110110110101111010111010000010101010011001000111101101011110 e8b38ae69ca7eb96abe6a5aeec9eb4e294b3e6a897ed9d98eab9b0e69db5eba0aa647de8b38ae69ca7eb96abe6a5aeec9eb4e294b3e6a897ed9d98eab9b0e69db5eba0aa647b5e
UHC 賊朧떫楮잴┳樗흘깰杵렪d}賊朧떫楮잴┳樗흘깰杵렪d{^ 11101110111001001101011011101000101101101011010111101110101111111100000011101010101001101011001111101110110000001100100011101010101100011111110111101110101111101000111010111000011001000111110111101110111001001101011011101000101101101011010111101110101111111100000011101010101001101011001111101110110000001100100011101010101100011111110111101110101111101000111010111000011001000111101101011110 eee4d6e8b6b5eebfc0eaa6b3eec0c8eab1fdeebe8eb8647deee4d6e8b6b5eebfc0eaa6b3eec0c8eab1fdeebe8eb8647b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)