To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 猷??烏?猷??烏?[猷??烏?猷??烏?[^ 10010111010100010011111100111111100010010100011100111111100101110101000100111111001111111000100101000111001111110101101110010111010100010011111100111111100010010100011100111111100101110101000100111111001111111000100101000111001111110101101101011110 97513f3f89473f97513f3f89473f5b97513f3f89473f97513f3f89473f5b5e
EUC-JP 猷??烏?猷??烏?[猷??烏?猷??烏?[^ 11001101101100100011111100111111101100011010100000111111110011011011001000111111001111111011000110101000001111110101101111001101101100100011111100111111101100011010100000111111110011011011001000111111001111111011000110101000001111110101101101011110 cdb23f3fb1a83fcdb23f3fb1a83f5bcdb23f3fb1a83fcdb23f3fb1a83f5b5e
UTF-8 猷듭뿉烏쌻猷듭뿉烏쌻[猷듭뿉烏쌻猷듭뿉烏쌻[^ 111001111000110010110111111010111001001110101101111010111011111110001001111001111000001110001111111011001000110010111011111001111000110010110111111010111001001110101101111010111011111110001001111001111000001110001111111011001000110010111011010110111110011110001100101101111110101110010011101011011110101110111111100010011110011110000011100011111110110010001100101110111110011110001100101101111110101110010011101011011110101110111111100010011110011110000011100011111110110010001100101110110101101101011110 e78cb7eb93adebbf89e7838fec8cbbe78cb7eb93adebbf89e7838fec8cbb5be78cb7eb93adebbf89e7838fec8cbbe78cb7eb93adebbf89e7838fec8cbb5b5e
UHC 猷듭뿉烏쌻猷듭뿉烏쌻[猷듭뿉烏쌻猷듭뿉烏쌻[^ 11101011101000111011010111101100100101111001000011101000101000011001101101101001111010111010001110110101111011001001011110010000111010001010000110011011011010010101101111101011101000111011010111101100100101111001000011101000101000011001101101101001111010111010001110110101111011001001011110010000111010001010000110011011011010010101101101011110 eba3b5ec9790e8a19b69eba3b5ec9790e8a19b695beba3b5ec9790e8a19b69eba3b5ec9790e8a19b695b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)