To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????k[}????????k[{^ 0011111100111111001111110011111100111111001111110011111100111111011010110101101101111101001111110011111100111111001111110011111100111111001111110011111101101011010110110111101101011110 3f3f3f3f3f3f3f3f6b5b7d3f3f3f3f3f3f3f3f6b5b7b5e
SJIS-WIN 堤???沚基??k[}堤???沚基??k[{^ 1001001011100111001111110011111100111111100111111000110110001010111011100011111100111111011010110101101101111101100100101110011100111111001111110011111110011111100011011000101011101110001111110011111101101011010110110111101101011110 92e73f3f3f9f8d8aee3f3f6b5b7d92e73f3f3f9f8d8aee3f3f6b5b7b5e
EUC-JP 堤???沚基??k[}堤???沚基??k[{^ 1100010011101001001111110011111100111111110111011110110110110100111100000011111100111111011010110101101101111101110001001110100100111111001111110011111111011101111011011011010011110000001111110011111101101011010110110111101101011110 c4e93f3f3fddedb4f03f3f6b5b7dc4e93f3f3fddedb4f03f3f6b5b7b5e
UTF-8 堤비렰렑沚基렰렖k[}堤비렰렑沚基렰렖k[{^ 11100101101000001010010011101011101110011000010011101011101000001011000011101011101000001001000111100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001011001101011010110110111110111100101101000001010010011101011101110011000010011101011101000001011000011101011101000001001000111100110101100101001101011100101100111111011101011101011101000001011000011101011101000001001011001101011010110110111101101011110 e5a0a4ebb984eba0b0eba091e6b29ae59fbaeba0b0eba0966b5b7de5a0a4ebb984eba0b0eba091e6b29ae59fbaeba0b0eba0966b5b7b5e
UHC 堤비렰렑沚基렰렖k[}堤비렰렑沚基렰렖k[{^ 111100001010011110111010111100011000111010111101100011101010011011110010101011111101000011110001100011101011110110001110101010110110101101011011011111011111000010100111101110101111000110001110101111011000111010100110111100101010111111010000111100011000111010111101100011101010101101101011010110110111101101011110 f0a7baf18ebd8ea6f2afd0f18ebd8eab6b5b7df0a7baf18ebd8ea6f2afd0f18ebd8eab6b5b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)