To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 繹????濡??沃???????濡??嗚??^ 11100011100010000011111100111111001111110011111110010100010001110011111100111111100101111000000000111111001111110011111100111111001111110011111100111111100101000100011100111111001111111001101001101010001111110011111101011110 e3883f3f3f3f94473f3f97803f3f3f3f3f3f3f94473f3f9a6a3f3f5e
EUC-JP 繹????濡??沃???????濡??嗚??^ 11100101111010000011111100111111001111110011111111000111101010000011111100111111110011011110000000111111001111110011111100111111001111110011111100111111110001111010100000111111001111111101001111001011001111110011111101011110 e5e83f3f3f3fc7a83f3fcde03f3f3f3f3f3f3fc7a83f3fd3cb3f3f5e
UTF-8 繹먯옿留뢐濡딅젵沃뚨쭅溜⑹옿留뢐濡딅젵嗚몃젌^ 11100111101110011011100111101011101010001010111111101100100110001011111111101111101001111000110111101011101000101001000011100110101111111010000111101011100101001000010111101100101000001011010111100110101100101000001111101011100110101010100011101100101011011000010111101111101001111000101111100010100100011011100111101100100110001011111111101111101001111000110111101011101000101001000011100110101111111010000111101011100101001000010111101100101000001011010111100101100101111001101011101011101010101000001111101100101000001000110001011110 e7b9b9eba8afec98bfefa78deba290e6bfa1eb9485eca0b5e6b283eb9aa8ecad85efa78be291b9ec98bfefa78deba290e6bfa1eb9485eca0b5e5979aebaa83eca08c5e
UHC 繹먯옿留뢐濡딅젵沃뚨쭅溜⑹옿留뢐濡딅젵嗚몃젌^ 111001101011101010010000111011001001111010110100111010111010011110001111010010111110101110100001100010101110101110100000101010011110100010101010100011001110011110100111100000011110101011111110101010011110110010011110101101001110101110100111100011110100101111101011101000011000101011101011101000001010100111100111111100001011100011101011101000001000110101011110 e6ba90ec9eb4eba78f4beba18aeba0a9e8aa8ce7a781eafea9ec9eb4eba78f4beba18aeba0a9e7f0b8eba08d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)