To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 蘖??旬??矣?&D蘖??旬??矣?&D^ 1001111101010000001111110011111110001111011110110011111100111111111000011110000100111111100000011001010101000100100111110101000000111111001111111000111101111011001111110011111111100001111000010011111110000001100101010100010001011110 9f503f3f8f7b3f3fe1e13f8195449f503f3f8f7b3f3fe1e13f8195445e
EUC-JP 蘖??旬??矣?&D蘖??旬??矣?&D^ 1101110110110001001111110011111110111101110111000011111100111111111000101110001100111111101000011111010101000100110111011011000100111111001111111011110111011100001111110011111111100010111000110011111110100001111101010100010001011110 ddb13f3fbddc3f3fe2e33fa1f544ddb13f3fbddc3f3fe2e33fa1f5445e
UTF-8 蘖뽮퉭旬녔뇻矣⑸&D蘖뽮퉭旬녔뇻矣⑸&D^ 111010001001100010010110111010111011110110101110111011011000100110101101111001101001011110101100111010111000010110010100111010111000011110111011111001111001111110100011111000101001000110111000111011111011110010000110010001001110100010011000100101101110101110111101101011101110110110001001101011011110011010010111101011001110101110000101100101001110101110000111101110111110011110011111101000111110001010010001101110001110111110111100100001100100010001011110 e89896ebbdaeed89ade697aceb8594eb87bbe79fa3e291b8efbc8644e89896ebbdaeed89ade697aceb8594eb87bbe79fa3e291b8efbc86445e
UHC 蘖뽮퉭旬녔뇻矣⑸&D蘖뽮퉭旬녔뇻矣⑸&D^ 111001011110111010010110111010101011100110000101111000101110001010110011111001101011010010100111111010111111100010101001111010111010001110100110010001001110010111101110100101101110101010111001100001011110001011100010101100111110011010110100101001111110101111111000101010011110101110100011101001100100010001011110 e5ee96eab985e2e2b3e6b4a7ebf8a9eba3a644e5ee96eab985e2e2b3e6b4a7ebf8a9eba3a6445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)