To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 陷抵ス・郢橸スキ螂」霑手岷隴ス郢冶アャ螂」鮓戎 11101000100111001001001011101111101111011010010111100111101110011001111011101111101111011011011111100101101001011010001111101000101111111000111011101000100110111010111111101000101011011011110111100111101110011001011011101000101100011010110011100101101001011010001111101001101101101000111101011110 e89c92efbda5e7b99eefbdb7e5a5a3e8bf8ee89bafe8adbde7b996e8b1ace5a5a3e9b68f5e
EUC-JP 陷抵ス・郢橸スキ螂」霑手岷隴ス郢冶アャ螂」鮓戎 11101111111111001100010011110001100011101011110110001110101001011110111010111011110111001111000110001110101111011000111010110111111010101010011110001110101000111111000011000001101111001110101011010110101100011111000010101111100011101011110111101110101110111100110011101010100011101011000110001110101011001110101010100111100011101010001111110010101110001011110110111111 effcc4f18ebd8ea5eebbdcf18ebd8eb7eaa78ea3f0c1bcead6b1f0af8ebdeebbccea8eb18eaceaa78ea3f2b8bdbf
UTF-8 陷抵ス・郢橸スキ螂」霑手岷隴ス郢冶アャ螂」鮓戎 111010011001100110110111111001101000101010110101111011111011110110111101111011111011110110100101111010011000001110100010111001101010100110111000111011111011110110111101111011111011110110110111111010001001111010000010111011111011110110100011111010011001110010010001111001101000100110001011111001011011001010110111111010011001101010110100111011111011110110111101111010011000001110100010111001011000011010110110111011111011110110110001111011111011110110101100111010001001111010000010111011111011110110100011111010011010111010010011111001101000100010001110 e999b7e68ab5efbdbdefbda5e983a2e6a9b8efbdbdefbdb7e89e82efbda3e99c91e6898be5b2b7e99ab4efbdbde983a2e586b6efbdb1efbdace89e82efbda3e9ae93e6888e
UHC 陷抵??????螂?霑手岷???冶??螂??戎 1111100111101000111011101011110100111111001111110011111100111111001111110011111111010101110011000011111111101111110001011110001010100010110110101011111000111111001111110011111111100101101001110011111100111111110101011100110000111111001111111110101111010100 f9e8eebd3f3f3f3f3f3fd5cc3fefc5e2a2dabe3f3f3fe5a73f3fd5cc3f3febd4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)