To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鄒夂擅縺樒セ夂擅縺夂セ夂擅縺樒セ夂擅縺喊 11100111101111101001101011100111100111011010000111100011100000011001111011100111101111101001101011100111100111011010000111100011100000011001101011100111101111101001101011100111100111011010000111100011100000011001111011100111101111101001101011100111100111011010000111100011100000011001101001011110 e7be9ae79da1e3819ee7be9ae79da1e3819ae7be9ae79da1e3819ee7be9ae79da1e3819a5e
EUC-JP 鄒夂擅縺樒セ夂擅縺夂セ夂擅縺樒セ夂擅縺喊 11101110110000001101010011101001110110101010001111100101111000011101110011101001100011101011111011010100111010011101101010100011111001011110000111010100111010011000111010111110110101001110100111011010101000111110010111100001110111001110100110001110101111101101010011101001110110101010001111100101111000011101001110111111 eec0d4e9daa3e5e1dce98ebed4e9daa3e5e1d4e98ebed4e9daa3e5e1dce98ebed4e9daa3e5e1d3bf
UTF-8 鄒夂擅縺樒セ夂擅縺夂セ夂擅縺樒セ夂擅縺喊 111010011000010010010010111001011010010010000010111001101001001110000101111001111011100010111010111001101010100010010010111011111011110110111110111001011010010010000010111001101001001110000101111001111011100010111010111001011010010010000010111011111011110110111110111001011010010010000010111001101001001110000101111001111011100010111010111001101010100010010010111011111011110110111110111001011010010010000010111001101001001110000101111001111011100010111010111001011001011010001010 e98492e5a482e69385e7b8bae6a892efbdbee5a482e69385e7b8bae5a482efbdbee5a482e69385e7b8bae6a892efbdbee5a482e69385e7b8bae5968a
UHC 鄒?擅????擅????擅????擅?喊 1111010111011011001111111111010010111010001111110011111100111111001111111111010010111010001111110011111100111111001111111111010010111010001111110011111100111111001111111111010010111010001111111111100111100010 f5db3ff4ba3f3f3f3ff4ba3f3f3f3ff4ba3f3f3f3ff4ba3ff9e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)