To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????c^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110001101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f635e
SJIS-WIN 乙??沚?乙??沚?乙??沚?乙??沚?c^ 100010011011001100111111001111111001111110001101001111111000100110110011001111110011111110011111100011010011111110001001101100110011111100111111100111111000110100111111100010011011001100111111001111111001111110001101001111110110001101011110 89b33f3f9f8d3f89b33f3f9f8d3f89b33f3f9f8d3f89b33f3f9f8d3f635e
EUC-JP 乙??沚?乙??沚?乙??沚?乙??沚?c^ 101100101011010100111111001111111101110111101101001111111011001010110101001111110011111111011101111011010011111110110010101101010011111100111111110111011110110100111111101100101011010100111111001111111101110111101101001111110110001101011110 b2b53f3fdded3fb2b53f3fdded3fb2b53f3fdded3fb2b53f3fdded3f635e
UTF-8 乙재헬沚빱乙재헬沚빪乙재헬沚빻乙재헬沚빵c^ 1110010010111001100110011110110010011110101011001110110110010111101011001110011010110010100110101110101110111001101100011110010010111001100110011110110010011110101011001110110110010111101011001110011010110010100110101110101110111001101010101110010010111001100110011110110010011110101011001110110110010111101011001110011010110010100110101110101110111001101110111110010010111001100110011110110010011110101011001110110110010111101011001110011010110010100110101110101110111001101101010110001101011110 e4b999ec9eaced97ace6b29aebb9b1e4b999ec9eaced97ace6b29aebb9aae4b999ec9eaced97ace6b29aebb9bbe4b999ec9eaced97ace6b29aebb9b5635e
UHC 乙재헬沚빱乙재헬沚빪乙재헬沚빻乙재헬沚빵c^ 111010111110000011000000111001111100011111101111111100101010111110111011101001001110101111100000110000001110011111000111111011111111001010101111101110111010001011101011111000001100000011100111110001111110111111110010101011111011101110101000111010111110000011000000111001111100011111101111111100101010111110111011101001110110001101011110 ebe0c0e7c7eff2afbba4ebe0c0e7c7eff2afbba2ebe0c0e7c7eff2afbba8ebe0c0e7c7eff2afbba7635e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)