To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 闌ア遉コ謐ィ閠瑚セ杣闌ア遉コ謐ィ閠瑚セ杣^ 111010001000110010110001111001111010010010111010111001101000110110101000111010001000000010001100111010001011111010011110010110111110100010001100101100011110011110100100101110101110011010001101101010001110100010000000100011001110100010111110100111100101101101011110 e88cb1e7a4bae68da8e8808ce8be9e5be88cb1e7a4bae68da8e8808ce8be9e5b5e
EUC-JP 闌ア遉コ謐ィ閠瑚セ杣闌ア遉コ謐ィ閠瑚セ杣^ 1110111111101100100011101011000111101110101001101000111010111010111010111110110110001110101010001110111111100000101110001110101010001110101111101101101110111100111011111110110010001110101100011110111010100110100011101011101011101011111011011000111010101000111011111110000010111000111010101000111010111110110110111011110001011110 efec8eb1eea68ebaebed8ea8efe0b8ea8ebedbbcefec8eb1eea68ebaebed8ea8efe0b8ea8ebedbbc5e
UTF-8 闌ア遉コ謐ィ閠瑚セ杣闌ア遉コ謐ィ閠瑚セ杣^ 11101001100101111000110011101111101111011011000111101001100000011000100111101111101111011011101011101000101011001001000011101111101111011010100011101001100101101010000011100111100100011001101011101111101111011011111011100110100111011010001111101001100101111000110011101111101111011011000111101001100000011000100111101111101111011011101011101000101011001001000011101111101111011010100011101001100101101010000011100111100100011001101011101111101111011011111011100110100111011010001101011110 e9978cefbdb1e98189efbdbae8ac90efbda8e996a0e7919aefbdbee69da3e9978cefbdb1e98189efbdbae8ac90efbda8e996a0e7919aefbdbee69da35e
UHC ????謐??瑚??????謐??瑚??^ 00111111001111110011111100111111110110101100110100111111001111111111101111010001001111110011111100111111001111110011111100111111110110101100110100111111001111111111101111010001001111110011111101011110 3f3f3f3fdacd3f3ffbd13f3f3f3f3f3fdacd3f3ffbd13f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)