To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 螂ェ蛛エ蟆願ェー驕懈純遶ェ謐牙援隱ー驕懈純B 11100101101001011010101011100101100000011011010011100101101100001000101011101000101010101011000011101001100000011001110011100110100011111000001111100111101010111010101011100110100011011000100111100101100010011000011111101000101010101011000011101001100000011001110011100110100011111000001101000010 e5a5aae581b4e5b08ae8aab0e9819ce68f83e7abaae68d89e58987e8aab0e9819ce68f8342
EUC-JP 螂ェ蛛エ蟆願ェー驕懈純遶ェ謐牙援隱ー驕懈純B 11101010101001111000111010101010111010011110000110001110101101001110101010110010101101001110101010001110101010101000111010110000111100011110000111011000111010001011110111100011111011101010110110001110101010101110101111101101101100101110011110110001111001111111000010101100100011101011000011110001111000011101100011101000101111011110001101000010 eaa78eaae9e18eb4eab2b4ea8eaa8eb0f1e1d8e8bde3eead8eaaebedb2e7b1e7f0ac8eb0f1e1d8e8bde342
UTF-8 螂ェ蛛エ蟆願ェー驕懈純遶ェ謐牙援隱ー驕懈純B 11101000100111101000001011101111101111011010101011101000100110111001101111101111101111011011010011101000100111111000011011101001101000011001100011101111101111011010101011101111101111011011000011101001101010011001010111100110100001111000100011100111101101001001010011101001100000011011011011101111101111011010101011101000101011001001000011100111100010011001100111100110100011111011010011101001100110101011000111101111101111011011000011101001101010011001010111100110100001111000100011100111101101001001010001000010 e89e82efbdaae89b9befbdb4e89f86e9a198efbdaaefbdb0e9a995e68788e7b494e981b6efbdaae8ac90e78999e68fb4e99ab1efbdb0e9a995e68788e7b49442
UHC 螂?蛛??願??驕懈純??謐牙援隱?驕懈純B 1101010111001100001111111111000111001000001111110011111111101010110000110011111100111111110011101111011011111010101010111110001011101101001111110011111111011010110011011110010010110011111010101011010111101011110111110011111111001110111101101111101010101011111000101110110101000010 d5cc3ff1c83f3feac33f3fcef6faabe2ed3f3fdacde4b3eab5ebdf3fcef6faabe2ed42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)