To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 掖⑤?掖⑤?茹k?掖⑤?掖⑤?茹k?B 10011101011101001000011101000100001111111001110101110100100001110100010000111111111001001010010110000010100010110011111110011101011101001000011101000100001111111001110101110100100001110100010000111111111001001010010110000010100010110011111101000010 9d7487443f9d7487443fe4a5828b3f9d7487443f9d7487443fe4a5828b3f42
EUC-JP 掖??掖??茹k?掖??掖??茹k?B 110110011101010100111111001111111101100111010101001111110011111111101000101001111010001111101011001111111101100111010101001111110011111111011001110101010011111100111111111010001010011110100011111010110011111101000010 d9d53f3fd9d53f3fe8a7a3eb3fd9d53f3fd9d53f3fe8a7a3eb3f42
UTF-8 掖⑤젦掖⑤젙茹k젩掖⑤젦掖⑤젙茹k젩B 11100110100011101001011011100010100100011010010011101100101000001010011011100110100011101001011011100010100100011010010011101100101000001001100111101000100011001011100111101111101111011000101111101100101000001010100111100110100011101001011011100010100100011010010011101100101000001010011011100110100011101001011011100010100100011010010011101100101000001001100111101000100011001011100111101111101111011000101111101100101000001010100101000010 e68e96e291a4eca0a6e68e96e291a4eca099e88cb9efbd8beca0a9e68e96e291a4eca0a6e68e96e291a4eca099e88cb9efbd8beca0a942
UHC 掖⑤젦掖⑤젙茹k젩掖⑤젦掖⑤젙茹k젩B 11100100111110101010100011101011101000001001111011100100111110101010100011101011101000001001010111100110101010101010001111101011101000001010000111100100111110101010100011101011101000001001111011100100111110101010100011101011101000001001010111100110101010101010001111101011101000001010000101000010 e4faa8eba09ee4faa8eba095e6aaa3eba0a1e4faa8eba09ee4faa8eba095e6aaa3eba0a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)