To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????矣??沃??而??循?????弛 001111110011111100111111001111110011111100111111111000011110000100111111001111111001011110000000001111110011111110001110101001110011111100111111100011110111101000111111001111110011111100111111001111111001001001101111 3f3f3f3f3f3fe1e13f3f97803f3f8ea73f3f8f7a3f3f3f3f3f926f
EUC-JP ???沅??矣??沃??而??循??獒??弛 00111111001111110011111110001111110001101110100100111111001111111110001011100011001111110011111111001101111000000011111100111111101111001010100100111111001111111011110111011011001111110011111110001111110010111011101100111111001111111100001111010000 3f3f3f8fc6e93f3fe2e33f3fcde03f3fbca93f3fbddb3f3f8fcbbb3f3fc3d0
UTF-8 蓮잙슣沅뉔걬矣멥럹沃섅끉而숁듉循뗪턃獒뺣맮弛 111011111010011010011001111011001001111010011001111011001000101010100011111001101011001010000101111010111000100110010100111010101011000110101100111001111001111110100011111010111010100110100101111010111001111110111001111001101011001010000011111011001000010010000101111010111000000110001001111010001000000010001100111011001000100010000001111010111001001110001001111001011011111010101010111010111001011110101010111011011000010010000011111001111000110110010010111010111011101010100011111010111010011110101110111001011011110010011011 efa699ec9e99ec8aa3e6b285eb8994eab1ace79fa3eba9a5eb9fb9e6b283ec8485eb8189e8808cec8881eb9389e5beaaeb97aaed8483e78d92ebbaa3eba7aee5bc9b
UHC 蓮잙슣沅뉔걬矣멥럹沃섅끉而숁듉循뗪턃獒뺣맮弛 1110011011100101100111111110101110011010101011111110101010110110100001111110100110000001100101011110101111111000101110001110001110001110100110001110100010101010100110001110001110000101101111001110110010111011100110011110011010001010101111001110001011100000100010111110101010110101100111111110100010100011100101011110101110010000101101011110110010101100 e6e59feb9aafeab687e98195ebf8b8e38e98e8aa98e385bcecbb99e68abce2e08beab59fe8a395eb90b5ecac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)