To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 枝葛?兪蝎???潘?竭愴陰蝎??毅?^ 1000111001111101100010101000101100111111100110010110000011100101100110010011111100111111001111111110000001001110001111111110001010010001100111001100011010001001010000011110010110011001001111110011111110001011010000100011111101011110 8e7d8a8b3f9960e5993f3f3fe04e3fe2919cc68941e5993f3f8b423f5e
EUC-JP 枝葛?兪蝎???潘?竭愴陰蝎??毅?^ 1011101111011110101100111110101100111111110100011100000111101001111110010011111100111111001111111101111110101111001111111110001111110001110110001100100010110001101000101110100111111001001111110011111110110101101000110011111101011110 bbdeb3eb3fd1c1e9f93f3f3fdfaf3fe3f1d8c8b1a2e9f93f3fb5a33f5e
UTF-8 枝葛료兪蝎렗렢溜潘렔竭愴陰蝎렗렢毅렰^ 11100110100111101001110111101000100100011001101111101011101000111000110011100101100001011010101011101000100111011000111011101011101000001001011111101011101000001010001011101111101001111000101111100110101111011001100011101011101000001001010011100111101010111010110111100110100001001011010011101001100110011011000011101000100111011000111011101011101000001001011111101011101000001010001011100110101011111000010111101011101000001011000001011110 e69e9de8919beba38ce585aae89d8eeba097eba0a2efa78be6bd98eba094e7abade684b4e999b0e89d8eeba097eba0a2e6af85eba0b05e
UHC 枝葛료兪蝎렗렢溜潘렔竭愴陰蝎렗렢毅렰^ 11110010101010111100101011100111101101111110000111101010111001001100101011101001100011101010110010001110101100111110101011111110110110101110101110001110101010011100101011100110111100111110000111101011111001001100101011101001100011101010110010001110101100111110101111110110100011101011110101011110 f2abcae7b7e1eae4cae98eac8eb3eafedaeb8ea9cae6f3e1ebe4cae98eac8eb3ebf68ebd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)