To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 魏??縡紗???趾?魏??縡紗???趾?^ 1110100110110000001111110011111111100011011100011000111011010001001111110011111100111111111001101110010000111111111010011011000000111111001111111110001101110001100011101101000100111111001111110011111111100110111001000011111101011110 e9b03f3fe3718ed13f3f3fe6e43fe9b03f3fe3718ed13f3f3fe6e43f5e
EUC-JP 魏??縡紗???趾?魏??縡紗???趾?^ 1111001010110010001111110011111111100101110100101011110011010011001111110011111100111111111011001110011000111111111100101011001000111111001111111110010111010010101111001101001100111111001111110011111111101100111001100011111101011110 f2b23f3fe5d2bcd33f3f3fece63ff2b23f3fe5d2bcd33f3f3fece63f5e
UTF-8 魏재횐縡紗歷林렲趾쌨魏재횐縡紗歷林렲趾쌤^ 11101001101011011000111111101100100111101010110011101101100110101001000011100111101110001010000111100111101101001001011111100110101011011011011111101111101001111011010011101011101000001011001011101000101101101011111011101100100011001010100011101001101011011000111111101100100111101010110011101101100110101001000011100111101110001010000111100111101101001001011111100110101011011011011111101111101001111011010011101011101000001011001011101000101101101011111011101100100011001010010001011110 e9ad8fec9eaced9a90e7b8a1e7b497e6adb7efa7b4eba0b2e8b6beec8ca8e9ad8fec9eaced9a90e7b8a1e7b497e6adb7efa7b4eba0b2e8b6beec8ca45e
UHC 魏재횐縡紗歷林렲趾쌨魏재횐縡紗歷林렲趾쌤^ 1110101011100000110000001110011111001000101110101110111010101101110111101110100111010101111101101110110011110111100011101011111111110010101111111011110111011110111010101110000011000000111001111100100010111010111011101010110111011110111010011101010111110110111011001111011110001110101111111111001010111111101111011101110001011110 eae0c0e7c8baeeaddee9d5f6ecf78ebff2bfbddeeae0c0e7c8baeeaddee9d5f6ecf78ebff2bfbddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)