To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??巡?6椅??猷??循?6???恂?+ 100101110101000100111111001111111000111110000100001111111000001001010101100010001101011000111111001111111001011101010001001111110011111110001111011110100011111110000010010101010011111100111111001111111001110010010110001111111000000101111011 97513f3f8f843f825588d63f3f97513f3f8f7a3f82553f3f3f9c963f817b
EUC-JP 猷??巡?6椅??猷??循?6???恂?+ 110011011011001000111111001111111011110111100100001111111010001110110110101100001101100000111111001111111100110110110010001111110011111110111101110110110011111110100011101101100011111100111111001111111101011111110110001111111010000111011100 cdb23f3fbde43fa3b6b0d83f3fcdb23f3fbddb3fa3b63f3f3fd7f63fa1dc
UTF-8 猷띠썳巡볥6椅뚢뼦猷듯굛循용6吏뽪썭恂귣+ 111001111000110010110111111010111001110110100000111011001000110110110011111001011011011110100001111010111011001110100101111011111011110010010110111001101010010010000101111010111001101010100010111010111011110010100110111001111000110010110111111010111001001110101111111010101011010110011011111001011011111010101010111011001001101010101001111011111011110010010110111011111010011110011110111010111011110110101010111011001000110110101101111001101000000110000010111010101011011110100011111011111011110010001011 e78cb7eb9da0ec8db3e5b7a1ebb3a5efbc96e6a485eb9aa2ebbca6e78cb7eb93afeab59be5beaaec9aa9efbc96efa79eebbdaaec8dade68182eab7a3efbc8b
UHC 猷띠썳巡볥6椅뚢뼦猷듯굛循용6吏뽪썭恂귣+ 111010111010001110110110111011001001101110100001111000101101111010010011111010111010001110110110111010111111010110001100111000101001011010101001111010111010001110110101111011011000001010000011111000101110000010111111111010111010001110110110111011001010011110010110111001101001101110011101111000101110000110000010111010111010001110101011 eba3b6ec9ba1e2de93eba3b6ebf58ce296a9eba3b5ed8283e2e0bfeba3b6eca796e69b9de2e182eba3ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)