To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??猷?ぜ???語⑤8??Ⅴ柔ッ??猿?? 1000100101000111001111110011111110010111010100010011111110000010101110100011111100111111001111111000110011101010100001110100010010000010010101110011111100111111100001110101100010001111010111111000001101100010001111110011111110001001100011100011111100111111 89473f3f97513f82ba3f3f3f8cea874482573f3f87588f5f83623f3f898e3f3f
EUC-JP 烏??猷?ぜ???語?8堉??柔ッ??猿?? 1011000110101000001111110011111111001101101100100011111110100100101111000011111100111111001111111011100011101100001111111010001110111000100011111011011111111101001111110011111110111101110000001010010111000011001111110011111110110001111011100011111100111111 b1a83f3fcdb23fa4bc3f3f3fb8ec3fa3b88fb7fd3f3fbdc0a5c33f3fb1ee3f3f
UTF-8 烏띻퀣猷녻ぜ琉꾟봼語⑤8堉좑Ⅴ柔ッ섉꼷猿볦춷 111001111000001110001111111010111001110110111011111011011000000010100011111001111000110010110111111010111000010110111011111000111000000110011100111011111010011110001100111010101011111010011111111010111011010010111100111010001010101010011110111000101001000110100100111011111011110010011000111001011010000010001001111011001010001010010001111000101000010110100100111001101001111110010100111000111000001110000011111011001000010010001001111010101011110010110111111001111000110010111111111010111011001110100110111011001011011010110111 e7838feb9dbbed80a3e78cb7eb85bbe3819cefa78ceabe9febb4bce8aa9ee291a4efbc98e5a089eca291e285a4e69f94e38383ec8489eabcb7e78cbfebb3a6ecb6b7
UHC 烏띻퀣猷녻ぜ琉꾟봼語⑤8堉좑Ⅴ柔ッ섉꼷猿볦춷 1110100010100001100011011110101010110011100101111110101110100011100001101110100010101010101111001110101110100100100001001110001010010100100000111110010111011110101010001110101110100011101110001110101110111100101000001110111110100101101101001110101011110101101010111100001110011000111001101000010010001111111010101011101110010011111011001010110110010011 e8a18deab397eba386e8aabceba484e29483e5dea8eba3b8ebbca0efa5b4eaf5abc398e6848feabb93ecad93

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)