To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???淫??蔭⑥??η????淫??蔭⑥??η?^ 001111110011111100111111100010001111101000111111001111111000100011111100100001110100010100111111001111111000001111000101001111110011111100111111001111111000100011111010001111110011111110001000111111001000011101000101001111110011111110000011110001010011111101011110 3f3f3f88fa3f3f88fc87453f3f83c53f3f3f3f88fa3f3f88fc87453f3f83c53f5e
EUC-JP ???淫??蔭???ηđ???淫??蔭???η?^ 001111110011111100111111101100001111110000111111001111111011000011111110001111110011111100111111101001101100011110001111101010011100001000111111001111110011111110110000111111000011111100111111101100001111111000111111001111110011111110100110110001110011111101011110 3f3f3fb0fc3f3fb0fe3f3f3fa6c78fa9c23f3f3fb0fc3f3fb0fe3f3f3fa6c73f5e
UTF-8 溜깅젡淫㏃꺎蔭⑥븥琉ηđ溜깅젡淫㏃꺎蔭⑥븥琉η뙄^ 11101111101001111000101111101010101110011000010111101100101000001010000111100110101101111010101111100011100011111000001111101010101110101000111011101000100101001010110111100010100100011010010111101011101110001010010111101111101001111000110011001110101101111100010010010001111011111010011110001011111010101011100110000101111011001010000010100001111001101011011110101011111000111000111110000011111010101011101010001110111010001001010010101101111000101001000110100101111010111011100010100101111011111010011110001100110011101011011111101011100110011000010001011110 efa78beab985eca0a1e6b7abe38f83eaba8ee894ade291a5ebb8a5efa78cceb7c491efa78beab985eca0a1e6b7abe38f83eaba8ee894ade291a5ebb8a5efa78cceb7eb99845e
UHC 溜깅젡淫㏃꺎蔭⑥븥琉ηđ溜깅젡淫㏃꺎蔭⑥븥琉η뙄^ 11101010111111101011000111101011101000001001101011101011111000101010011111101100100000111011010011101011111000111010100011101100100101011000111011101011101001001010010111100111101010011010001011101010111111101011000111101011101000001001101011101011111000101010011111101100100000111011010011101011111000111010100011101100100101011000111011101011101001001010010111100111100011001000101001011110 eafeb1eba09aebe2a7ec83b4ebe3a8ec958eeba4a5e7a9a2eafeb1eba09aebe2a7ec83b4ebe3a8ec958eeba4a5e78c8a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)