To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????L???????????L^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100110001011110 3f3f3f3f3f3f3f3f3f3f3f4c3f3f3f3f3f3f3f3f3f3f3f4c5e
SJIS-WIN 猷??癌γ?巡?5節?L猷??癌γ?巡?5節?L^ 10010111010100010011111100111111100010101110000010000011110000010011111110001111100001000011111110000010010101001001000011011111001111110100110010010111010100010011111100111111100010101110000010000011110000010011111110001111100001000011111110000010010101001001000011011111001111110100110001011110 97513f3f8ae083c13f8f843f825490df3f4c97513f3f8ae083c13f8f843f825490df3f4c5e
EUC-JP 猷??癌γ?巡?5節?L猷??癌γ?巡?5節?L^ 11001101101100100011111100111111101101001110001010100110110000110011111110111101111001000011111110100011101101011100000011100001001111110100110011001101101100100011111100111111101101001110001010100110110000110011111110111101111001000011111110100011101101011100000011100001001111110100110001011110 cdb23f3fb4e2a6c33fbde43fa3b5c0e13f4ccdb23f3fb4e2a6c33fbde43fa3b5c0e13f4c5e
UTF-8 猷듐걗癌γ걚巡볥5節쒽L猷듐걗癌γ걚巡볥5節쒽L^ 11100111100011001011011111101011100100111001000011101010101100011001011111100111100110011000110011001110101100111110101010110001100110101110010110110111101000011110101110110011101001011110111110111100100101011110011110101111100000001110110010010010101111010100110011100111100011001011011111101011100100111001000011101010101100011001011111100111100110011000110011001110101100111110101010110001100110101110010110110111101000011110101110110011101001011110111110111100100101011110011110101111100000001110110010010010101111010100110001011110 e78cb7eb9390eab197e7998cceb3eab19ae5b7a1ebb3a5efbc95e7af80ec92bd4ce78cb7eb9390eab197e7998cceb3eab19ae5b7a1ebb3a5efbc95e7af80ec92bd4c5e
UHC 猷듐걗癌γ걚巡볥5節쒽L猷듐걗癌γ걚巡볥5節쒽L^ 1110101110100011101101011110001110000001100000101110010011011111101001011110001110000001100001001110001011011110100100111110101110100011101101011110111110111101100111010101001001001100111010111010001110110101111000111000000110000010111001001101111110100101111000111000000110000100111000101101111010010011111010111010001110110101111011111011110110011101010100100100110001011110 eba3b5e38182e4dfa5e38184e2de93eba3b5efbd9d524ceba3b5e38182e4dfa5e38184e2de93eba3b5efbd9d524c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)