To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 猷??嗽?6義??烏?6椅??猷??擾よ?^ 1001011101010001001111110011111110011010011101010011111110000010010101011000101101100000001111110011111110001001010001110011111110000010010101011000100011010110001111110011111110010111010100010011111100111111100011111110111110000010111001100011111101011110 97513f3f9a753f82558b603f3f89473f825588d63f3f97513f3f8fef82e63f5e
EUC-JP 猷??嗽?6義??烏?6椅??猷??擾よ?^ 1100110110110010001111110011111111010011110101100011111110100011101101101011010111000001001111110011111110110001101010000011111110100011101101101011000011011000001111110011111111001101101100100011111100111111101111101111000110100100111010000011111101011110 cdb23f3fd3d63fa3b6b5c13f3fb1a83fa3b6b0d83f3fcdb23f3fbef1a4e83f5e
UTF-8 猷띠뿄嗽뉖6義⑶삫烏쏅6椅뚢뾿猷띠썿擾よ붃^ 11100111100011001011011111101011100111011010000011101011101111111000010011100101100101111011110111101011100010011001011011101111101111001001011011100111101111101010100111100010100100011011011011101100100000101010101111100111100000111000111111101100100011111000010111101111101111001001011011100110101001001000010111101011100110101010001011101011101111101011111111100111100011001011011111101011100111011010000011101100100011011011111111100110100100111011111011100011100000101000100011101011101101101000001101011110 e78cb7eb9da0ebbf84e597bdeb8996efbc96e7bea9e291b6ec82abe7838fec8f85efbc96e6a485eb9aa2ebbebfe78cb7eb9da0ec8dbfe693bee38288ebb6835e
UHC 猷띠뿄嗽뉖6義⑶삫烏쏅6椅뚢뾿猷띠썿擾よ붃^ 11101011101000111011011011101100100101111000110011100001111101011000011111101011101000111011011011101011111110011010100111101001100110001010101011101000101000011001101111101011101000111011011011101011111101011000110011100010100101111000011111101011101000111011011011101100100110111010100111101000111101101010101011101000100101001011111101011110 eba3b6ec978ce1f587eba3b6ebf9a9e998aae8a19beba3b6ebf58ce29787eba3b6ec9ba9e8f6aae894bf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)