To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D 0011111100111111001111110011111100111111001111110011111100111111001111110100010000111111001111110011111100111111001111110011111100111111001111110011111101000100 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f44
SJIS-WIN 庄??奉∝而冪??D庄??奉∝而冪??D 100011111010111100111111001111111001010111110010100000011110010110001110101001111001100101110000001111110011111101000100100011111010111100111111001111111001010111110010100000011110010110001110101001111001100101110000001111110011111101000100 8faf3f3f95f281e58ea799703f3f448faf3f3f95f281e58ea799703f3f44
EUC-JP 庄??奉∝而冪??D庄??奉∝而冪??D 101111101011000100111111001111111100101011110100101000101110011110111100101010011101000111010001001111110011111101000100101111101011000100111111001111111100101011110100101000101110011110111100101010011101000111010001001111110011111101000100 beb13f3fcaf4a2e7bca9d1d13f3f44beb13f3fcaf4a2e7bca9d1d13f3f44
UTF-8 庄얏렫奉∝而冪렰렓D庄얏렫奉∝而冪렰렓D 1110010110111010100001001110110010010110100011111110101110100000101010111110010110100101100010011110001010001000100111011110100010000000100011001110010110000110101010101110101110100000101100001110101110100000100100110100010011100101101110101000010011101100100101101000111111101011101000001010101111100101101001011000100111100010100010001001110111101000100000001000110011100101100001101010101011101011101000001011000011101011101000001001001101000100 e5ba84ec968feba0abe5a589e2889de8808ce586aaeba0b0eba09344e5ba84ec968feba0abe5a589e2889de8808ce586aaeba0b0eba09344
UHC 庄얏렫奉∝而冪렰렓D庄얏렫奉∝而冪렰렓D 1110110111100100101111101110011010001110101110011101110011100101101000011111000011101100101110111101100011110001100011101011110110001110101010000100010011101101111001001011111011100110100011101011100111011100111001011010000111110000111011001011101111011000111100011000111010111101100011101010100001000100 ede4bee68eb9dce5a1f0ecbbd8f18ebd8ea844ede4bee68eb9dce5a1f0ecbbd8f18ebd8ea844

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)