To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????E?????? 0011111100111111001111110011111100111111001111110011111101000101001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f453f3f3f3f3f3f
SJIS-WIN 項?齬餘荊?削E?齬餘荊?蒡 1000110110000000001111111110101010010111111010010101000010001100011101000011111110001101111011010100010100111111111010101001011111101001010100001000110001110100001111111110010011101110 8d803fea97e9508c743f8ded453fea97e9508c743fe4ee
EUC-JP 項?齬餘荊?削E?齬餘荊?蒡 1011100111100000001111111111001111110111111100011011000110110111110101010011111110111010111011110100010100111111111100111111011111110001101100011011011111010101001111111110100011110000 b9e03ff3f7f1b1b7d53fbaef453ff3f7f1b1b7d53fe8f0
UTF-8 項렠齬餘荊거削E렠齬餘荊거蒡 11101001101000001000010111101011101000001010000011101001101111011010110011101001101001001001100011101000100011011000101011101010101100011011000011100101100010011000101001000101111010111010000010100000111010011011110110101100111010011010010010011000111010001000110110001010111010101011000110110000111010001001001010100001 e9a085eba0a0e9bdace9a498e88d8aeab1b0e5898a45eba0a0e9bdace9a498e88d8aeab1b0e892a1
UHC 項렠齬餘荊거削E렠齬餘荊거蒡 111110101010001110001110101100011110010111100001111001101010111011111011101010101011000011000101110111101111101101000101100011101011000111100101111000011110011010101110111110111010101010110000110001011101101110111100 faa38eb1e5e1e6aefbaab0c5defb458eb1e5e1e6aefbaab0c5dbbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)