To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 猷??猷ラ?殉?+俉?6率?+冗??猷ヨ?^ 10010111010100010011111100111111100101110101000110000011100010010011111110001111011111010011111110000001011110111111101001100001001111111000001001010101100101111010011000111111100000010111101110001111111001110011111100111111100101110101000110000011100010000011111101011110 97513f3f975183893f8f7d3f817bfa613f825597a63f817b8fe73f3f975183883f5e
EUC-JP 猷??猷ラ?殉?+俉?6率?+冗??猷ヨ?^ 1100110110110010001111110011111111001101101100101010010111101001001111111011110111011110001111111010000111011100100011111011000110111011001111111010001110110110110011101010100000111111101000011101110010111110111010010011111100111111110011011011001010100101111010000011111101011110 cdb23f3fcdb2a5e93fbdde3fa1dc8fb1bb3fa3b6cea83fa1dcbee93f3fcdb2a5e83f5e
UTF-8 猷뜯뀦猷ラ레殉먮+俉뤿6率앸+冗밤닇猷ヨ뜌^ 11100111100011001011011111101011100111001010111111101011100000001010011011100111100011001011011111100011100000111010100111101011101000001000100011100110101011101000100111101011101010001010111011101111101111001000101111100100101111111000100111101011101001001011111111101111101111001001011011100111100011101000011111101100100101011011100011101111101111001000101111100101100001101001011111101011101100001010010011101011100010111000011111100111100011001011011111100011100000111010100011101011100111001000110001011110 e78cb7eb9cafeb80a6e78cb7e383a9eba088e6ae89eba8aeefbc8be4bf89eba4bfefbc96e78e87ec95b8efbc8be58697ebb0a4eb8b87e78cb7e383a8eb9c8c5e
UHC 猷뜯뀦猷ラ레殉먮+俉뤿6率앸+冗밤닇猷ヨ뜌^ 11101011101000111011011011100010100001011001110111101011101000111010101111101001101101111011100111100010111001101001000011101011101000111010101111100111111010111000111111101011101000111011011011100001111000111001110111101011101000111010101111101001101101111011100111100011100010001001000011101011101000111010101111101000100011011000111101011110 eba3b6e2859deba3abe9b7b9e2e690eba3abe7eb8feba3b6e1e39deba3abe9b7b9e38890eba3abe88d8f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)