To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セュ竺爾捨セ、承ンセュ竺爾捨セ、承ン^ 101111101010110110001110101100011000111010100010100011101100110010111110101001001000111110110011110111011011111010101101100011101011000110001110101000101000111011001100101111101010010010001111101100111101110101011110 bead8eb18ea28eccbea48fb3ddbead8eb18ea28eccbea48fb3dd5e
EUC-JP セュ竺爾捨セ、承ンセュ竺爾捨セ、承ン^ 10001110101111101000111010101101101111001011001110111100101001001011110011001110100011101011111010001110101001001011111010110101100011101101110110001110101111101000111010101101101111001011001110111100101001001011110011001110100011101011111010001110101001001011111010110101100011101101110101011110 8ebe8eadbcb3bca4bcce8ebe8ea4beb58edd8ebe8eadbcb3bca4bcce8ebe8ea4beb58edd5e
UTF-8 セュ竺爾捨セ、承ンセュ竺爾捨セ、承ン^ 11101111101111011011111011101111101111011010110111100111101010111011101011100111100010001011111011100110100011011010100011101111101111011011111011101111101111011010010011100110100010011011111111101111101111101001110111101111101111011011111011101111101111011010110111100111101010111011101011100111100010001011111011100110100011011010100011101111101111011011111011101111101111011010010011100110100010011011111111101111101111101001110101011110 efbdbeefbdade7abbae788bee68da8efbdbeefbda4e689bfefbe9defbdbeefbdade7abbae788bee68da8efbdbeefbda4e689bfefbe9d5e
UHC ??竺爾捨??承???竺爾捨??承?^ 001111110011111111110101111001111110110010110011110111101101011100111111001111111110001110101111001111110011111100111111111101011110011111101100101100111101111011010111001111110011111111100011101011110011111101011110 3f3ff5e7ecb3ded73f3fe3af3f3f3ff5e7ecb3ded73f3fe3af3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)