To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 熱??茹??檍??}熱??茹??檍??{^ 100101000100110100111111001111111110010010100101001111110011111110011110111110000011111100111111011111011001010001001101001111110011111111100100101001010011111100111111100111101111100000111111001111110111101101011110 944d3f3fe4a53f3f9ef83f3f7d944d3f3fe4a53f3f9ef83f3f7b5e
EUC-JP 熱??茹??檍??}熱??茹??檍??{^ 110001111010111000111111001111111110100010100111001111110011111111011100111110100011111100111111011111011100011110101110001111110011111111101000101001110011111100111111110111001111101000111111001111110111101101011110 c7ae3f3fe8a73f3fdcfa3f3f7dc7ae3f3fe8a73f3fdcfa3f3f7b5e
UTF-8 熱뗫젣茹띾젷檍용젇}熱뗫젣茹띾젷檍용젇{^ 111001111000011010110001111010111001011110101011111011001010000010100011111010001000110010111001111010111001110110111110111011001010000010110111111001101010101010001101111011001001101010101001111011001010000010000111011111011110011110000110101100011110101110010111101010111110110010100000101000111110100010001100101110011110101110011101101111101110110010100000101101111110011010101010100011011110110010011010101010011110110010100000100001110111101101011110 e786b1eb97abeca0a3e88cb9eb9dbeeca0b7e6aa8dec9aa9eca0877de786b1eb97abeca0a3e88cb9eb9dbeeca0b7e6aa8dec9aa9eca0877b5e
UHC 熱뗫젣茹띾젷檍용젇}熱뗫젣茹띾젷檍용젇{^ 111001101111000010001011111010111010000010011100111001101010101010001101111010111010000010101011111001011110010110111111111010111010000010001010011111011110011011110000100010111110101110100000100111001110011010101010100011011110101110100000101010111110010111100101101111111110101110100000100010100111101101011110 e6f08beba09ce6aa8deba0abe5e5bfeba08a7de6f08beba09ce6aa8deba0abe5e5bfeba08a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)