To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??猷ラ?循k+愿?4誼?+鉛??猷リ? 10010111010100010011111100111111100101110101000110000011100010010011111110001111011110101000001010001011100000010111101110011100110000110011111110000010010100111000101101100010001111111000000101111011100010011001010000111111001111111001011101010001100000111000101000111111 97513f3f975183893f8f7a828b817b9cc33f82538b623f817b89943f3f9751838a3f
EUC-JP 猷??猷ラ?循k+愿?4誼?+鉛??猷リ? 11001101101100100011111100111111110011011011001010100101111010010011111110111101110110111010001111101011101000011101110011011000110001010011111110100011101101001011010111000011001111111010000111011100101100011111010000111111001111111100110110110010101001011110101000111111 cdb23f3fcdb2a5e93fbddba3eba1dcd8c53fa3b4b5c33fa1dcb1f43f3fcdb2a5ea3f
UTF-8 猷띔물猷ラ레循k+愿꾨4誼덈+鉛녴궟猷リ큵 111001111000110010110111111010111001110110010100111010111010110010111100111001111000110010110111111000111000001110101001111010111010000010001000111001011011111010101010111011111011110110001011111011111011110010001011111001101000010010111111111010101011111010101000111011111011110010010100111010001010101010111100111010111000110110001000111011111011110010001011111010011000100110011011111010111000010110110100111010101011011010011111111001111000110010110111111000111000001110101010111011011000000110110101 e78cb7eb9d94ebacbce78cb7e383a9eba088e5beaaefbd8befbc8be684bfeabea8efbc94e8aabceb8d88efbc8be9899beb85b4eab69fe78cb7e383aaed81b5
UHC 猷띔물猷ラ레循k+愿꾨4誼덈+鉛녴궟猷リ큵 111010111010001110110110111010101011100110110000111010111010001110101011111010011011011110111001111000101110000010100011111010111010001110101011111010101011010010000100111010111010001110110100111010111111111010001000111010111010001110101011111001101110011110000110111000111000001010110010111010111010001110101011111010101011010010000100 eba3b6eab9b0eba3abe9b7b9e2e0a3eba3abeab484eba3b4ebfe88eba3abe6e786e382b2eba3abeab484

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)