To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?莎??彦?指??◇?莎??彦?指??●^ 0011111111100100101100110011111100111111100101010100011000111111100011100111011100111111001111111000000110011110001111111110010010110011001111110011111110010101010001100011111110001110011101110011111100111111100000011001110001011110 3fe4b33f3f95463f8e773f3f819e3fe4b33f3f95463f8e773f3f819c5e
EUC-JP 蔣莎??彦?指??◇蔣莎??彦?指??●^ 100011111101100110110110111010001011010100111111001111111100100110100111001111111011101111011000001111110011111110100001111111101000111111011001101101101110100010110101001111110011111111001001101001110011111110111011110110000011111100111111101000011111110001011110 8fd9b6e8b53f3fc9a73fbbd83f3fa1fe8fd9b6e8b53f3fc9a73fbbd83f3fa1fc5e
UTF-8 蔣莎렍렊彦렗指편렮◇蔣莎렍렊彦렗指편렮●^ 11101000100101001010001111101000100011101000111011101011101000001000110111101011101000001000101011100101101111011010011011101011101000001001011111100110100011001000011111101101100011101011100011101011101000001010111011100010100101111000011111101000100101001010001111101000100011101000111011101011101000001000110111101011101000001000101011100101101111011010011011101011101000001001011111100110100011001000011111101101100011101011100011101011101000001010111011100010100101111000111101011110 e894a3e88e8eeba08deba08ae5bda6eba097e68c87ed8eb8eba0aee29787e894a3e88e8eeba08deba08ae5bda6eba097e68c87ed8eb8eba0aee2978f5e
UHC 蔣莎렍렊彦렗指편렮◇蔣莎렍렊彦렗指편렮●^ 1110110111111000110111101110110110001110101000111000111010100001111001011110100110001110101011001111001010100110110001101110110110001110101110111010000111011110111011011111100011011110111011011000111010100011100011101010000111100101111010011000111010101100111100101010011011000110111011011000111010111011101000011101110001011110 edf8deed8ea38ea1e5e98eacf2a6c6ed8ebba1deedf8deed8ea38ea1e5e98eacf2a6c6ed8ebba1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)