To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 荳ケ螻槫羡荳ケ螻槫穀荳ケ螻槫羡荳ケ螻槫穀^ 11100100101110001011100111100101101100011001111011100101111110111001000111100100101110001011100111100101101100011001111011100101100011011001001011100100101110001011100111100101101100011001111011100101111110111001000111100100101110001011100111100101101100011001111011100101100011011001001001011110 e4b8b9e5b19ee5fb91e4b8b9e5b19ee58d92e4b8b9e5b19ee5fb91e4b8b9e5b19ee58d925e
EUC-JP 荳ケ螻槫羡荳ケ螻槫穀荳ケ螻槫羡荳ケ螻槫穀^ 11101000101110101000111010111001111010101011001111011100111001111000111111010101101011101110100010111010100011101011100111101010101100111101110011100111101110011111001011101000101110101000111010111001111010101011001111011100111001111000111111010101101011101110100010111010100011101011100111101010101100111101110011100111101110011111001001011110 e8ba8eb9eab3dce78fd5aee8ba8eb9eab3dce7b9f2e8ba8eb9eab3dce78fd5aee8ba8eb9eab3dce7b9f25e
UTF-8 荳ケ螻槫羡荳ケ螻槫穀荳ケ螻槫羡荳ケ螻槫穀^ 11101000100011011011001111101111101111011011100111101000100111101011101111100110101001111010101111100111101111101010000111101000100011011011001111101111101111011011100111101000100111101011101111100110101001111010101111100111101010011000000011101000100011011011001111101111101111011011100111101000100111101011101111100110101001111010101111100111101111101010000111101000100011011011001111101111101111011011100111101000100111101011101111100110101001111010101111100111101010011000000001011110 e88db3efbdb9e89ebbe6a7abe7bea1e88db3efbdb9e89ebbe6a7abe7a980e88db3efbdb9e89ebbe6a7abe7bea1e88db3efbdb9e89ebbe6a7abe7a9805e
UHC 荳????荳???穀荳????荳???穀^ 110101001110010100111111001111110011111100111111110101001110010100111111001111110011111111001101110110101101010011100101001111110011111100111111001111111101010011100101001111110011111100111111110011011101101001011110 d4e53f3f3f3fd4e53f3f3fcddad4e53f3f3f3fd4e53f3f3fcdda5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)