To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??掖??恂?+??6???猷??永 1001011101010001001111110011111110011101011101000011111100111111100111001001011000111111100000010111101100111111001111111000001001010101001111110011111100111111100101110101000100111111001111111000100101101001 97513f3f9d743f3f9c963f817b3f3f82553f3f3f97513f3f8969
EUC-JP 猷??掖??恂?+??6蓀??猷??永 11001101101100100011111100111111110110011101010100111111001111111101011111110110001111111010000111011100001111110011111110100011101101101000111111011000111110000011111100111111110011011011001000111111001111111011000111001010 cdb23f3fd9d53f3fd7f63fa1dc3f3fa3b68fd8f83f3fcdb23f3fb1ca
UTF-8 猷띕걹掖경썭恂곷+廬볥6蓀꺿댘猷띕걹永 111001111000110010110111111010111001110110010101111010101011000110111001111001101000111010010110111010101011001010111101111011001000110110101101111001101000000110000010111010101011001110110111111011111011110010001011111011111010011010000010111010111011001110100101111011111011110010010110111010001001001110000000111010101011101010111111111010111000110010011000111001111000110010110111111010111001110110010101111010101011000110111001111001101011000010111000 e78cb7eb9d95eab1b9e68e96eab2bdec8dade68182eab3b7efbc8befa682ebb3a5efbc96e89380eababfeb8c98e78cb7eb9d95eab1b9e6b0b8
UHC 猷띕걹掖경썭恂곷+廬볥6蓀꺿댘猷띕걹永 1110101110100011101101101110101110000001100111011110010011111010101100001110011010011011100111011110001011100001100000011110101110100011101010111110010111111110100100111110101110100011101101101110000111100000100000111110001010001000101111001110101110100011101101101110101110000001100111011110011110110101 eba3b6eb819de4fab0e69b9de2e181eba3abe5fe93eba3b6e1e083e288bceba3b6eb819de7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)