To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n?????????nB 001111110011111100111111001111110011111100111111001111110011111100111111011011100011111100111111001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f3f3f6e3f3f3f3f3f3f3f3f3f6e42
SJIS-WIN 猷????6???n猷????6???nB 10010111010100010011111100111111001111110011111110000010010101010011111100111111001111110110111010010111010100010011111100111111001111110011111110000010010101010011111100111111001111110110111001000010 97513f3f3f3f82553f3f3f6e97513f3f3f3f82553f3f3f6e42
EUC-JP 猷????6???n猷????6???nB 11001101101100100011111100111111001111110011111110100011101101100011111100111111001111110110111011001101101100100011111100111111001111110011111110100011101101100011111100111111001111110110111001000010 cdb23f3f3f3fa3b63f3f3f6ecdb23f3f3f3fa3b63f3f3f6e42
UTF-8 猷띕뇾輦됰6劉잌뮁n猷띕뇾輦됰6劉잌뮁nB 111001111000110010110111111010111001110110010101111010111000011110111110111011111010011010011000111010111001000010110000111011111011110010010110111011111010011110000111111011001001111010001100111010111010111010000001011011101110011110001100101101111110101110011101100101011110101110000111101111101110111110100110100110001110101110010000101100001110111110111100100101101110111110100111100001111110110010011110100011001110101110101110100000010110111001000010 e78cb7eb9d95eb87beefa698eb90b0efbc96efa787ec9e8cebae816ee78cb7eb9d95eb87beefa698eb90b0efbc96efa787ec9e8cebae816e42
UHC 猷띕뇾輦됰6劉잌뮁n猷띕뇾輦됰6劉잌뮁nB 111010111010001110110110111010111000011110011111111001101110010010001001111010111010001110110110111010101110010110011111111001011001001010010000011011101110101110100011101101101110101110000111100111111110011011100100100010011110101110100011101101101110101011100101100111111110010110010010100100000110111001000010 eba3b6eb879fe6e489eba3b6eae59fe592906eeba3b6eb879fe6e489eba3b6eae59fe592906e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)