To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 裁?址??莎???伯裁?址??莎???伯B 1000110111011001001111111001101010101100001111110011111111100100101100110011111100111111001111111001010010001100100011011101100100111111100110101010110000111111001111111110010010110011001111110011111100111111100101001000110001000010 8dd93f9aac3f3fe4b33f3f3f948c8dd93f9aac3f3fe4b33f3f3f948c42
EUC-JP 裁?址??莎???伯裁?址??莎???伯B 1011101011011011001111111101010010101110001111110011111111101000101101010011111100111111001111111100011111101100101110101101101100111111110101001010111000111111001111111110100010110101001111110011111100111111110001111110110001000010 badb3fd4ae3f3fe8b53f3f3fc7ecbadb3fd4ae3f3fe8b53f3f3fc7ec42
UTF-8 裁렗址쇘렯莎렊잴렕伯裁렗址쇘렯莎렊잴렕伯B 11101000101000111000000111101011101000001001011111100101100111011000000011101100100001111001100011101011101000001010111111101000100011101000111011101011101000001000101011101100100111101011010011101011101000001001010111100100101111001010111111101000101000111000000111101011101000001001011111100101100111011000000011101100100001111001100011101011101000001010111111101000100011101000111011101011101000001000101011101100100111101011010011101011101000001001010111100100101111001010111101000010 e8a381eba097e59d80ec8798eba0afe88e8eeba08aec9eb4eba095e4bcafe8a381eba097e59d80ec8798eba0afe88e8eeba08aec9eb4eba095e4bcaf42
UHC 裁렗址쇘렯莎렊잴렕伯裁렗址쇘렯莎렊잴렕伯B 1110111010101110100011101010110011110010101000111011110011100111100011101011110011011110111011011000111010100001110000001110101010001110101010101101101111010111111011101010111010001110101011001111001010100011101111001110011110001110101111001101111011101101100011101010000111000000111010101000111010101010110110111101011101000010 eeae8eacf2a3bce78ebcdeed8ea1c0ea8eaadbd7eeae8eacf2a3bce78ebcdeed8ea1c0ea8eaadbd742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)