To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 已??堪再?猛?畔??已??堪再?猛?畔??^ 100110111101111100111111001111111000101010101100100011011100010000111111100101101101001000111111100101001100100000111111001111111001101111011111001111110011111110001010101011001000110111000100001111111001011011010010001111111001010011001000001111110011111101011110 9bdf3f3f8aac8dc43f96d23f94c83f3f9bdf3f3f8aac8dc43f96d23f94c83f3f5e
EUC-JP 已??堪再薏猛?畔??已??堪再薏猛?畔??^ 11010110111000010011111100111111101101001010111010111010110001101000111111011001110111101100110011010100001111111100100011001010001111110011111111010110111000010011111100111111101101001010111010111010110001101000111111011001110111101100110011010100001111111100100011001010001111110011111101011110 d6e13f3fb4aebac68fd9deccd43fc8ca3f3fd6e13f3fb4aebac68fd9deccd43fc8ca3f3f5e
UTF-8 已고렗堪再薏猛렭畔렲螺已고렗堪再薏猛렭畔렲羅^ 11100101101101111011001011101010101100111010000011101011101000001001011111100101101000001010101011100101100001101000110111101000100101101000111111100111100011001001101111101011101000001010110111100111100101011001010011101011101000001011001011101111101001001001000111100101101101111011001011101010101100111010000011101011101000001001011111100101101000001010101011100101100001101000110111101000100101101000111111100111100011001001101111101011101000001010110111100111100101011001010011101011101000001011001011101111101001001000111101011110 e5b7b2eab3a0eba097e5a0aae5868de8968fe78c9beba0ade79594eba0b2efa491e5b7b2eab3a0eba097e5a0aae5868de8968fe78c9beba0ade79594eba0b2efa48f5e
UHC 已고렗堪再薏猛렭畔렲螺已고렗堪再薏猛렭畔렲羅^ 111011001010101110110000111011011000111010101100110010101110110111101110101000101110101111111011110110001110110110001110101110101101101011101101100011101011111111010001110111101110110010101011101100001110110110001110101011001100101011101101111011101010001011101011111110111101100011101101100011101011101011011010111011011000111010111111110100011101110001011110 ecabb0ed8eaccaedeea2ebfbd8ed8ebadaed8ebfd1deecabb0ed8eaccaedeea2ebfbd8ed8ebadaed8ebfd1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)