To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 猷??松ル6誼??z猷??松ル6誼??zB 10010111010100010011111100111111100011111011110010000011100010111000001001010101100010110110001000111111001111110111101010010111010100010011111100111111100011111011110010000011100010111000001001010101100010110110001000111111001111110111101001000010 97513f3f8fbc838b82558b623f3f7a97513f3f8fbc838b82558b623f3f7a42
EUC-JP 猷??松ル6誼??z猷??松ル6誼??zB 11001101101100100011111100111111101111101011111010100101111010111010001110110110101101011100001100111111001111110111101011001101101100100011111100111111101111101011111010100101111010111010001110110110101101011100001100111111001111110111101001000010 cdb23f3fbebea5eba3b6b5c33f3f7acdb23f3fbebea5eba3b6b5c33f3f7a42
UTF-8 猷띔물松ル6誼댿뼦z猷띔물松ル6誼댿뼦zB 111001111000110010110111111010111001110110010100111010111010110010111100111001101001110110111110111000111000001110101011111011111011110010010110111010001010101010111100111010111000110010111111111010111011110010100110011110101110011110001100101101111110101110011101100101001110101110101100101111001110011010011101101111101110001110000011101010111110111110111100100101101110100010101010101111001110101110001100101111111110101110111100101001100111101001000010 e78cb7eb9d94ebacbce69dbee383abefbc96e8aabceb8cbfebbca67ae78cb7eb9d94ebacbce69dbee383abefbc96e8aabceb8cbfebbca67a42
UHC 猷띔물松ル6誼댿뼦z猷띔물松ル6誼댿뼦zB 111010111010001110110110111010101011100110110000111000011110011010101011111010111010001110110110111010111111111010001000111000101001011010101001011110101110101110100011101101101110101010111001101100001110000111100110101010111110101110100011101101101110101111111110100010001110001010010110101010010111101001000010 eba3b6eab9b0e1e6abeba3b6ebfe88e296a97aeba3b6eab9b0e1e6abeba3b6ebfe88e296a97a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)