To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????i???????????iB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN セュ竺セュ式セュ竺爾捨iセュ竺セュ式セュ竺爾捨iB 1011111010101101100011101011000110111110101011011000111010101110101111101010110110001110101100011000111010100010100011101100110001101001101111101010110110001110101100011011111010101101100011101010111010111110101011011000111010110001100011101010001010001110110011000110100101000010 bead8eb1bead8eaebead8eb18ea28ecc69bead8eb1bead8eaebead8eb18ea28ecc6942
EUC-JP セュ竺セュ式セュ竺爾捨iセュ竺セュ式セュ竺爾捨iB 1000111010111110100011101010110110111100101100111000111010111110100011101010110110111100101100001000111010111110100011101010110110111100101100111011110010100100101111001100111001101001100011101011111010001110101011011011110010110011100011101011111010001110101011011011110010110000100011101011111010001110101011011011110010110011101111001010010010111100110011100110100101000010 8ebe8eadbcb38ebe8eadbcb08ebe8eadbcb3bca4bcce698ebe8eadbcb38ebe8eadbcb08ebe8eadbcb3bca4bcce6942
UTF-8 セュ竺セュ式セュ竺爾捨iセュ竺セュ式セュ竺爾捨iB 111011111011110110111110111011111011110110101101111001111010101110111010111011111011110110111110111011111011110110101101111001011011110010001111111011111011110110111110111011111011110110101101111001111010101110111010111001111000100010111110111001101000110110101000011010011110111110111101101111101110111110111101101011011110011110101011101110101110111110111101101111101110111110111101101011011110010110111100100011111110111110111101101111101110111110111101101011011110011110101011101110101110011110001000101111101110011010001101101010000110100101000010 efbdbeefbdade7abbaefbdbeefbdade5bc8fefbdbeefbdade7abbae788bee68da869efbdbeefbdade7abbaefbdbeefbdade5bc8fefbdbeefbdade7abbae788bee68da86942
UHC ??竺??式??竺爾捨i??竺??式??竺爾捨iB 0011111100111111111101011110011100111111001111111110001111010010001111110011111111110101111001111110110010110011110111101101011101101001001111110011111111110101111001110011111100111111111000111101001000111111001111111111010111100111111011001011001111011110110101110110100101000010 3f3ff5e73f3fe3d23f3ff5e7ecb3ded7693f3ff5e73f3fe3d23f3ff5e7ecb3ded76942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)