To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 蹄蓬繆辛橋?禎?n}蹄蓬繆辛橋?禎?n{^ 100100101111101110010110010010001110001101111000100100000110100010001011101101000011111110010010111101010011111101101110011111011001001011111011100101100100100011100011011110001001000001101000100010111011010000111111100100101111010100111111011011100111101101011110 92fb9648e37890688bb43f92f53f6e7d92fb9648e37890688bb43f92f53f6e7b5e
EUC-JP 蹄蓬繆辛橋?禎?n}蹄蓬繆辛橋?禎?n{^ 110001001111110111001011101010011110010111011001101111111100100110110110101101100011111111000100111101110011111101101110011111011100010011111101110010111010100111100101110110011011111111001001101101101011011000111111110001001111011100111111011011100111101101011110 c4fdcba9e5d9bfc9b6b63fc4f73f6e7dc4fdcba9e5d9bfc9b6b63fc4f73f6e7b5e
UTF-8 蹄蓬繆辛橋ㄳ禎렔n}蹄蓬繆辛橋ㄳ禎렔n{^ 1110100010111001100001001110100010010011101011001110011110111001100001101110100010111110100110111110011010101001100010111110001110000100101100111110011110100110100011101110101110100000100101000110111001111101111010001011100110000100111010001001001110101100111001111011100110000110111010001011111010011011111001101010100110001011111000111000010010110011111001111010011010001110111010111010000010010100011011100111101101011110 e8b984e893ace7b986e8be9be6a98be384b3e7a68eeba0946e7de8b984e893ace7b986e8be9be6a98be384b3e7a68eeba0946e7b5e
UHC 蹄蓬繆辛橋ㄳ禎렔n}蹄蓬繆辛橋ㄳ禎렔n{^ 11110000101101001101110011101111110110011111000011100011111101001100111011101001101001001010001111101111111011101000111010101001011011100111110111110000101101001101110011101111110110011111000011100011111101001100111011101001101001001010001111101111111011101000111010101001011011100111101101011110 f0b4dcefd9f0e3f4cee9a4a3efee8ea96e7df0b4dcefd9f0e3f4cee9a4a3efee8ea96e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)