To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 魏??沚?魏??沚?魏??沚?魏??沚?^ 1110100110110000001111110011111110011111100011010011111111101001101100000011111100111111100111111000110100111111111010011011000000111111001111111001111110001101001111111110100110110000001111110011111110011111100011010011111101011110 e9b03f3f9f8d3fe9b03f3f9f8d3fe9b03f3f9f8d3fe9b03f3f9f8d3f5e
EUC-JP 魏??沚?魏??沚?魏??沚?魏??沚?^ 1111001010110010001111110011111111011101111011010011111111110010101100100011111100111111110111011110110100111111111100101011001000111111001111111101110111101101001111111111001010110010001111110011111111011101111011010011111101011110 f2b23f3fdded3ff2b23f3fdded3ff2b23f3fdded3ff2b23f3fdded3f5e
UTF-8 魏재헬沚뺄魏재헬沚뺑魏재헬沚뺄魏재헬沚뺑^ 11101001101011011000111111101100100111101010110011101101100101111010110011100110101100101001101011101011101110101000010011101001101011011000111111101100100111101010110011101101100101111010110011100110101100101001101011101011101110101001000111101001101011011000111111101100100111101010110011101101100101111010110011100110101100101001101011101011101110101000010011101001101011011000111111101100100111101010110011101101100101111010110011100110101100101001101011101011101110101001000101011110 e9ad8fec9eaced97ace6b29aebba84e9ad8fec9eaced97ace6b29aebba91e9ad8fec9eaced97ace6b29aebba84e9ad8fec9eaced97ace6b29aebba915e
UHC 魏재헬沚뺄魏재헬沚뺑魏재헬沚뺄魏재헬沚뺑^ 1110101011100000110000001110011111000111111011111111001010101111101110111010110011101010111000001100000011100111110001111110111111110010101011111011101110110001111010101110000011000000111001111100011111101111111100101010111110111011101011001110101011100000110000001110011111000111111011111111001010101111101110111011000101011110 eae0c0e7c7eff2afbbaceae0c0e7c7eff2afbbb1eae0c0e7c7eff2afbbaceae0c0e7c7eff2afbbb15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)