To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 闌ア遏ョ謐ィ閠瑚。党闌ア遏ョ謐ィ閠瑚。怒^ 111010001000110010110001111001111001111110101110111001101000110110101000111010001000000010001100111010001010000110010011011111011110100010001100101100011110011110011111101011101110011010001101101010001110100010000000100011001110100010100001100100110111101101011110 e88cb1e79faee68da8e8808ce8a1937de88cb1e79faee68da8e8808ce8a1937b5e
EUC-JP 闌ア遏ョ謐ィ閠瑚。党闌ア遏ョ謐ィ閠瑚。怒^ 1110111111101100100011101011000111101110101000011000111010101110111010111110110110001110101010001110111111100000101110001110101010001110101000011100010111011110111011111110110010001110101100011110111010100001100011101010111011101011111011011000111010101000111011111110000010111000111010101000111010100001110001011101110001011110 efec8eb1eea18eaeebed8ea8efe0b8ea8ea1c5deefec8eb1eea18eaeebed8ea8efe0b8ea8ea1c5dc5e
UTF-8 闌ア遏ョ謐ィ閠瑚。党闌ア遏ョ謐ィ閠瑚。怒^ 11101001100101111000110011101111101111011011000111101001100000011000111111101111101111011010111011101000101011001001000011101111101111011010100011101001100101101010000011100111100100011001101011101111101111011010000111100101100001011001101011101001100101111000110011101111101111011011000111101001100000011000111111101111101111011010111011101000101011001001000011101111101111011010100011101001100101101010000011100111100100011001101011101111101111011010000111100110100000001001001001011110 e9978cefbdb1e9818fefbdaee8ac90efbda8e996a0e7919aefbda1e5859ae9978cefbdb1e9818fefbdaee8ac90efbda8e996a0e7919aefbda1e680925e
UHC ????謐??瑚??????謐??瑚?怒^ 0011111100111111001111110011111111011010110011010011111100111111111110111101000100111111001111110011111100111111001111110011111111011010110011010011111100111111111110111101000100111111110100101100000101011110 3f3f3f3fdacd3f3ffbd13f3f3f3f3f3fdacd3f3ffbd13fd2c15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)