To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??訟?6韋??猷??巽?6韋?+恁 1001011101010001001111110011111110001111110101110011111110000010010101011110100011101000001111110011111110010111010100010011111100111111100100100100011000111111100000100101010111101000111010000011111110000001011110111001110010001100 97513f3f8fd73f8255e8e83f3f97513f3f92463f8255e8e83f817b9c8c
EUC-JP 猷??訟?6韋??猷??巽?6韋?+恁 1100110110110010001111110011111110111110110110010011111110100011101101101111000011101010001111110011111111001101101100100011111100111111110000111010011100111111101000111011011011110000111010100011111110100001110111001101011111101100 cdb23f3fbed93fa3b6f0ea3f3fcdb23f3fc3a73fa3b6f0ea3fa1dcd7ec
UTF-8 猷띔물訟귣6韋쀢뒄猷뜯뀦巽숇6韋쀫+恁 111001111000110010110111111010111001110110010100111010111010110010111100111010001010100010011111111010101011011110100011111011111011110010010110111010011001111110001011111011001000000010100010111010111001001010000100111001111000110010110111111010111001110010101111111010111000000010100110111001011011011110111101111011001000100010000111111011111011110010010110111010011001111110001011111011001000000010101011111011111011110010001011111001101000000110000001 e78cb7eb9d94ebacbce8a89feab7a3efbc96e99f8bec80a2eb9284e78cb7eb9cafeb80a6e5b7bdec8887efbc96e99f8bec80abefbc8be68181
UHC 猷띔물訟귣6韋쀢뒄猷뜯뀦巽숇6韋쀫+恁 1110101110100011101101101110101010111001101100001110000111101000100000101110101110100011101101101110101011011111100101111110001010001010100000101110101110100011101101101110001010000101100111011110000111011110100110011110101110100011101101101110101011011111100101111110101110100011101010111110110011110110 eba3b6eab9b0e1e882eba3b6eadf97e28a82eba3b6e2859de1de99eba3b6eadf97eba3abecf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)