To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 枝葛?陰蝎??誼潘?葛?陰蝎??毅?^ 1000111001111101100010101000101100111111100010010100000111100101100110010011111100111111100010110110001011100000010011100011111110001010100010110011111110001001010000011110010110011001001111110011111110001011010000100011111101011110 8e7d8a8b3f8941e5993f3f8b62e04e3f8a8b3f8941e5993f3f8b423f5e
EUC-JP 枝葛?陰蝎??誼潘?葛?陰蝎??毅?^ 1011101111011110101100111110101100111111101100011010001011101001111110010011111100111111101101011100001111011111101011110011111110110011111010110011111110110001101000101110100111111001001111110011111110110101101000110011111101011110 bbdeb3eb3fb1a2e9f93f3fb5c3dfaf3fb3eb3fb1a2e9f93f3fb5a33f5e
UTF-8 枝葛료陰蝎렗렢誼潘렔葛료陰蝎렗렢毅렰^ 11100110100111101001110111101000100100011001101111101011101000111000110011101001100110011011000011101000100111011000111011101011101000001001011111101011101000001010001011101000101010101011110011100110101111011001100011101011101000001001010011101000100100011001101111101011101000111000110011101001100110011011000011101000100111011000111011101011101000001001011111101011101000001010001011100110101011111000010111101011101000001011000001011110 e69e9de8919beba38ce999b0e89d8eeba097eba0a2e8aabce6bd98eba094e8919beba38ce999b0e89d8eeba097eba0a2e6af85eba0b05e
UHC 枝葛료陰蝎렗렢誼潘렔葛료陰蝎렗렢毅렰^ 11110010101010111100101011100111101101111110000111101011111001001100101011101001100011101010110010001110101100111110101111111110110110101110101110001110101010011100101011100111101101111110000111101011111001001100101011101001100011101010110010001110101100111110101111110110100011101011110101011110 f2abcae7b7e1ebe4cae98eac8eb3ebfedaeb8ea9cae7b7e1ebe4cae98eac8eb3ebf68ebd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)