To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?東鏃?鏃?{?宏?東鏃?鏃?{?槐^ 0011111110010011100011001110100001010110001111111110100001010110001111111000000101101111001111111000110101000111001111111001001110001100111010000101011000111111111010000101011000111111100000010110111100111111100111101100010101011110 3f938ce8563fe8563f816f3f8d473f938ce8563fe8563f816f3f9ec55e
EUC-JP ?東鏃?鏃?{?宏?東鏃?鏃?{?槐^ 0011111111000101111011001110111110110111001111111110111110110111001111111010000111010000001111111011100110101000001111111100010111101100111011111011011100111111111011111011011100111111101000011101000000111111110111001100011101011110 3fc5ecefb73fefb73fa1d03fb9a83fc5ecefb73fefb73fa1d03fdcc75e
UTF-8 뤯東鏃아鏃퐥{쌈宏뤯東鏃아鏃퐥{쌈槐^ 11101011101001001010111111100110100111011011000111101001100011111000001111101100100101011000010011101001100011111000001111101101100100001010010111101111101111011001101111101100100011001000100011100101101011101000111111101011101001001010111111100110100111011011000111101001100011111000001111101100100101011000010011101001100011111000001111101101100100001010010111101111101111011001101111101100100011001000100011100110101001111001000001011110 eba4afe69db1e98f83ec9584e98f83ed90a5efbd9bec8c88e5ae8feba4afe69db1e98f83ec9584e98f83ed90a5efbd9bec8c88e6a7905e
UHC 뤯東鏃아鏃퐥{쌈宏뤯東鏃아鏃퐥{쌈槐^ 10001111110111011101010011010100111100001110110010111110110001101111000011101100101111011000111010100011111110111011110111010011110011101101101110001111110111011101010011010100111100001110110010111110110001101111000011101100101111011000111010100011111110111011110111010011110011101101100101011110 8fddd4d4f0ecbec6f0ecbd8ea3fbbdd3cedb8fddd4d4f0ecbec6f0ecbd8ea3fbbdd3ced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)