To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?衣絲???除?????畯孟∧梯?畯脈? 11100011011100010011111110001000110111111110001101001110001111110011111100111111100011111001110000111111001111110011111100111111001111111111101101101111100101101101000010000001110010001001001011110010001111111111101101101111100101101010110000111111 e3713f88dfe34e3f3f3f8f9c3f3f3f3f3ffb6f96d081c892f23ffb6f96ac3f
EUC-JP 縡?衣絲???除?????畯孟∧梯?畯脈? 111001011101001000111111101100001110000111100101101011110011111100111111001111111011110111111100001111110011111100111111001111110011111110001111110011011011101111001100110100101010001011001010110001001111010000111111100011111100110110111011110011001010111000111111 e5d23fb0e1e5af3f3f3fbdfc3f3f3f3f3f8fcdbbccd2a2cac4f43f8fcdbbccae3f
UTF-8 縡렕衣絲횅亐렕除곌렢닿렕렟畯孟∧梯렟畯脈렲 111001111011100010100001111010111010000010010101111010001010000110100011111001111011010110110010111011011001101010000101111001001011101010010000111010111010000010010101111010011001100110100100111010101011001110001100111010111010000010100010111010111000101110111111111010111010000010010101111010111010000010011111111001111001010110101111111001011010110110011111111000101000100010100111111001101010001010101111111010111010000010011111111001111001010110101111111010001000010010001000111010111010000010110010 e7b8a1eba095e8a1a3e7b5b2ed9a85e4ba90eba095e999a4eab38ceba0a2eb8bbfeba095eba09fe795afe5ad9fe288a7e6a2afeba09fe795afe88488eba0b2
UHC 縡렕衣絲횅亐렕除곌렢닿렕렟畯孟∧梯렟畯脈렲 111011101010110110001110101010101110101111111101110111101110101011001000101101111110101010100111100011101010101011110000101101101011000011101010100011101011001110110100111010101000111010101010100011101011000011110001111000011101100011101011101000011111110011110000101011001000111010110000111100011110000111011000111001101000111010111111 eead8eaaebfddeeac8b7eaa78eaaf0b6b0ea8eb3b4ea8eaa8eb0f1e1d8eba1fcf0ac8eb0f1e1d8e68ebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)