To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 軟??譯??筌λ?D軟??譯??筌λ?D^ 1001001111101110001111110011111111100110101000010011111100111111111000101010001110000011110010010011111101000100100100111110111000111111001111111110011010100001001111110011111111100010101000111000001111001001001111110100010001011110 93ee3f3fe6a13f3fe2a383c93f4493ee3f3fe6a13f3fe2a383c93f445e
EUC-JP 軟??譯??筌λ?D軟??譯??筌λ?D^ 1100011011110000001111110011111111101100101000110011111100111111111001001010010110100110110010110011111101000100110001101111000000111111001111111110110010100011001111110011111111100100101001011010011011001011001111110100010001011110 c6f03f3feca33f3fe4a5a6cb3f44c6f03f3feca33f3fe4a5a6cb3f445e
UTF-8 軟쒕젍譯볥젘筌λ졃D軟쒕젍譯볥젘筌λ졃D^ 11101000101110111001111111101100100100101001010111101100101000001000110111101000101011011010111111101011101100111010010111101100101000001001100011100111101011011000110011001110101110111110110010100001100000110100010011101000101110111001111111101100100100101001010111101100101000001000110111101000101011011010111111101011101100111010010111101100101000001001100011100111101011011000110011001110101110111110110010100001100000110100010001011110 e8bb9fec9295eca08de8adafebb3a5eca098e7ad8ccebbeca18344e8bb9fec9295eca08de8adafebb3a5eca098e7ad8ccebbeca183445e
UHC 軟쒕젍譯볥젘筌λ졃D軟쒕젍譯볥젘筌λ졃D^ 111001101110001110011100111010111010000010001110111001101011101110010011111010111010000010010100111011111010011110100101111010111010000010110100010001001110011011100011100111001110101110100000100011101110011010111011100100111110101110100000100101001110111110100111101001011110101110100000101101000100010001011110 e6e39ceba08ee6bb93eba094efa7a5eba0b444e6e39ceba08ee6bb93eba094efa7a5eba0b4445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)