To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C?????????U 0011111100111111001111110011111100111111001111110011111100111111001111110100001100111111001111110011111100111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f55
SJIS-WIN 庄??奉∝而冪??C庄??奉∝而冪??U 100011111010111100111111001111111001010111110010100000011110010110001110101001111001100101110000001111110011111101000011100011111010111100111111001111111001010111110010100000011110010110001110101001111001100101110000001111110011111101010101 8faf3f3f95f281e58ea799703f3f438faf3f3f95f281e58ea799703f3f55
EUC-JP 庄??奉∝而冪??C庄??奉∝而冪??U 101111101011000100111111001111111100101011110100101000101110011110111100101010011101000111010001001111110011111101000011101111101011000100111111001111111100101011110100101000101110011110111100101010011101000111010001001111110011111101010101 beb13f3fcaf4a2e7bca9d1d13f3f43beb13f3fcaf4a2e7bca9d1d13f3f55
UTF-8 庄얏렫奉∝而冪렰렓C庄얏렫奉∝而冪렰렓U 1110010110111010100001001110110010010110100011111110101110100000101010111110010110100101100010011110001010001000100111011110100010000000100011001110010110000110101010101110101110100000101100001110101110100000100100110100001111100101101110101000010011101100100101101000111111101011101000001010101111100101101001011000100111100010100010001001110111101000100000001000110011100101100001101010101011101011101000001011000011101011101000001001001101010101 e5ba84ec968feba0abe5a589e2889de8808ce586aaeba0b0eba09343e5ba84ec968feba0abe5a589e2889de8808ce586aaeba0b0eba09355
UHC 庄얏렫奉∝而冪렰렓C庄얏렫奉∝而冪렰렓U 1110110111100100101111101110011010001110101110011101110011100101101000011111000011101100101110111101100011110001100011101011110110001110101010000100001111101101111001001011111011100110100011101011100111011100111001011010000111110000111011001011101111011000111100011000111010111101100011101010100001010101 ede4bee68eb9dce5a1f0ecbbd8f18ebd8ea843ede4bee68eb9dce5a1f0ecbbd8f18ebd8ea855

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)