To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訷夭が邁醍ー皮泪薰ィ訷夭が邁醍ー皮泪薰ョ 111110111010010010011010111011101000001010101010111001111011000010010001111001111011000010010100111001111001111110100011111110111001111010101000111110111010010010011010111011101000001010101010111001111011000010010001111001111011000010010100111001111001111110100011111110111001111010101110 fba49aee82aae7b091e7b094e79fa3fb9ea8fba49aee82aae7b091e7b094e79fa3fb9eae
EUC-JP 訷夭が邁醍ー皮泪?ィ訷夭が邁醍ー皮泪?ョ 10001111110111011101010011010100111100001010010010101100111011101011001011000010111010011000111010110000110010001110100111011110101001010011111110001110101010001000111111011101110101001101010011110000101001001010110011101110101100101100001011101001100011101011000011001000111010011101111010100101001111111000111010101110 8fddd4d4f0a4aceeb2c2e98eb0c8e9dea53f8ea88fddd4d4f0a4aceeb2c2e98eb0c8e9dea53f8eae
UTF-8 訷夭が邁醍ー皮泪薰ィ訷夭が邁醍ー皮泪薰ョ 111010001010100010110111111001011010010010101101111000111000000110001100111010011000001010000001111010011000011010001101111011111011110110110000111001111001101010101110111001101011001110101010111010001001011010110000111011111011110110101000111010001010100010110111111001011010010010101101111000111000000110001100111010011000001010000001111010011000011010001101111011111011110110110000111001111001101010101110111001101011001110101010111010001001011010110000111011111011110110101110 e8a8b7e5a4ade3818ce98281e9868defbdb0e79aaee6b3aae896b0efbda8e8a8b7e5a4ade3818ce98281e9868defbdb0e79aaee6b3aae896b0efbdae
UHC ?夭が邁醍?皮?薰??夭が邁醍?皮?薰? 0011111111101000111011001010101010101100110110001110010011110000101101010011111111111001101010110011111111111101101110010011111100111111111010001110110010101010101011001101100011100100111100001011010100111111111110011010101100111111111111011011100100111111 3fe8ecaaacd8e4f0b53ff9ab3ffdb93f3fe8ecaaacd8e4f0b53ff9ab3ffdb93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)