To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????{N}????????{N{^ 0011111100111111001111110011111100111111001111110011111100111111011110110100111001111101001111110011111100111111001111110011111100111111001111110011111101111011010011100111101101011110 3f3f3f3f3f3f3f3f7b4e7d3f3f3f3f3f3f3f3f7b4e7b5e
SJIS-WIN 蛛イ閠恵蛛イ闥粘{N}蛛イ閠恵蛛イ闥粘{N{^ 1110010110000001101100101110100010000000100011000110001011100101100000011011001011101000100100101001010001010011011110110100111001111101111001011000000110110010111010001000000010001100011000101110010110000001101100101110100010010010100101000101001101111011010011100111101101011110 e581b2e8808c62e581b2e89294537b4e7de581b2e8808c62e581b2e89294537b4e7b5e
EUC-JP 蛛イ閠恵蛛イ闥粘{N}蛛イ閠恵蛛イ闥粘{N{^ 111010011110000110001110101100101110111111100000101101111100001111101001111000011000111010110010111011111111001011000111101101000111101101001110011111011110100111100001100011101011001011101111111000001011011111000011111010011110000110001110101100101110111111110010110001111011010001111011010011100111101101011110 e9e18eb2efe0b7c3e9e18eb2eff2c7b47b4e7de9e18eb2efe0b7c3e9e18eb2eff2c7b47b4e7b5e
UTF-8 蛛イ閠恵蛛イ闥粘{N}蛛イ閠恵蛛イ闥粘{N{^ 11101000100110111001101111101111101111011011001011101001100101101010000011100110100000011011010111101000100110111001101111101111101111011011001011101001100101111010010111100111101100101001100001111011010011100111110111101000100110111001101111101111101111011011001011101001100101101010000011100110100000011011010111101000100110111001101111101111101111011011001011101001100101111010010111100111101100101001100001111011010011100111101101011110 e89b9befbdb2e996a0e681b5e89b9befbdb2e997a5e7b2987b4e7de89b9befbdb2e996a0e681b5e89b9befbdb2e997a5e7b2987b4e7b5e
UHC 蛛???蛛??粘{N}蛛???蛛??粘{N{^ 1111000111001000001111110011111100111111111100011100100000111111001111111110111111000100011110110100111001111101111100011100100000111111001111110011111111110001110010000011111100111111111011111100010001111011010011100111101101011110 f1c83f3f3ff1c83f3fefc47b4e7df1c83f3f3ff1c83f3fefc47b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)