To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????~?????????B 0011111100111111001111110011111100111111001111110011111100111111001111110111111000111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f7e3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??????μ?~筌??????μ?B 111000101010001100111111001111110011111100111111001111110011111110000011110010100011111101111110111000101010001100111111001111110011111100111111001111110011111110000011110010100011111101000010 e2a33f3f3f3f3f3f83ca3f7ee2a33f3f3f3f3f3f83ca3f42
EUC-JP 筌??飡??洹μ?~筌??彛??洹μ?B 1110010010100101001111110011111110001111111010001100100000111111001111111000111111000111101110101010011011001100001111110111111011100100101001010011111100111111100011111011110011111010001111110011111110001111110001111011101010100110110011000011111101000010 e4a53f3f8fe8c83f3f8fc7baa6cc3f7ee4a53f3f8fbcfa3f3f8fc7baa6cc3f42
UTF-8 筌듦램飡볩㏊洹μ췅~筌듦램彛좑㏊洹μ컣B 111001111010110110001100111010111001001110100110111010111001111010101000111010011010001110100001111010111011001110101001111000111000111110001010111001101011010010111001110011101011110011101100101101111000010101111110111001111010110110001100111010111001001110100110111010111001111010101000111001011011110110011011111011001010001010010001111000111000111110001010111001101011010010111001110011101011110011101100101110111010001101000010 e7ad8ceb93a6eb9ea8e9a3a1ebb3a9e38f8ae6b4b9cebcecb7857ee7ad8ceb93a6eb9ea8e5bd9beca291e38f8ae6b4b9cebcecbba342
UHC 筌듦램飡볩㏊洹μ췅~筌듦램彛좑㏊洹μ컣B 1110111110100111101101011110101010110111101001011110000111100010100100111110111110100111101101011110101010110111101001011110110010101101101000000111111011101111101001111011010111101010101101111010010111101100101011011010000011101111101001111011010111101010101101111010010111101100101100001000111001000010 efa7b5eab7a5e1e293efa7b5eab7a5ecada07eefa7b5eab7a5ecada0efa7b5eab7a5ecb08e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)