To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遭???沚基??傳???遭???沚基??傳???B 100100011001100000111111001111110011111110011111100011011000101011101110001111110011111110011001010000100011111100111111001111111001000110011000001111110011111100111111100111111000110110001010111011100011111100111111100110010100001000111111001111110011111101000010 91983f3f3f9f8d8aee3f3f99423f3f3f91983f3f3f9f8d8aee3f3f99423f3f3f42
EUC-JP 遭???沚基??傳???遭???沚基??傳???B 110000011111100000111111001111110011111111011101111011011011010011110000001111110011111111010001101000110011111100111111001111111100000111111000001111110011111100111111110111011110110110110100111100000011111100111111110100011010001100111111001111110011111101000010 c1f83f3f3fddedb4f03f3fd1a33f3f3fc1f83f3f3fddedb4f03f3fd1a33f3f3f42
UTF-8 遭ㆁ렰렒沚基렰렓傳뀜렰렟遭ㆁ렰렒沚基렰렓傳뀜렰렟B 11101001100000011010110111100011100001101000000111101011101000001011000011101011101000001001001011100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001001111100101100000101011001111101011100000001001110011101011101000001011000011101011101000001001111111101001100000011010110111100011100001101000000111101011101000001011000011101011101000001001001011100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001001111100101100000101011001111101011100000001001110011101011101000001011000011101011101000001001111101000010 e981ade38681eba0b0eba092e6b29ae59fbaeba0b0eba093e582b3eb809ceba0b0eba09fe981ade38681eba0b0eba092e6b29ae59fbaeba0b0eba093e582b3eb809ceba0b0eba09f42
UHC 遭ㆁ렰렒沚基렰렓傳뀜렰렟遭ㆁ렰렒沚基렰렓傳뀜렰렟B 11110000111001001010010011110001100011101011110110001110101001111111001010101111110100001111000110001110101111011000111010101000111011101110111010110010111100011000111010111101100011101011000011110000111001001010010011110001100011101011110110001110101001111111001010101111110100001111000110001110101111011000111010101000111011101110111010110010111100011000111010111101100011101011000001000010 f0e4a4f18ebd8ea7f2afd0f18ebd8ea8eeeeb2f18ebd8eb0f0e4a4f18ebd8ea7f2afd0f18ebd8ea8eeeeb2f18ebd8eb042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)