To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 熱??茹??業??}熱??茹??業??{^ 100101000100110100111111001111111110010010100101001111110011111110001011110001100011111100111111011111011001010001001101001111110011111111100100101001010011111100111111100010111100011000111111001111110111101101011110 944d3f3fe4a53f3f8bc63f3f7d944d3f3fe4a53f3f8bc63f3f7b5e
EUC-JP 熱??茹??業??}熱??茹??業??{^ 110001111010111000111111001111111110100010100111001111110011111110110110110010000011111100111111011111011100011110101110001111110011111111101000101001110011111100111111101101101100100000111111001111110111101101011110 c7ae3f3fe8a73f3fb6c83f3f7dc7ae3f3fe8a73f3fb6c83f3f7b5e
UTF-8 熱뗫젣茹띾젶業깅젻}熱뗫젣茹띾젶業깅젻{^ 111001111000011010110001111010111001011110101011111011001010000010100011111010001000110010111001111010111001110110111110111011001010000010110110111001101010010110101101111010101011100110000101111011001010000010111011011111011110011110000110101100011110101110010111101010111110110010100000101000111110100010001100101110011110101110011101101111101110110010100000101101101110011010100101101011011110101010111001100001011110110010100000101110110111101101011110 e786b1eb97abeca0a3e88cb9eb9dbeeca0b6e6a5adeab985eca0bb7de786b1eb97abeca0a3e88cb9eb9dbeeca0b6e6a5adeab985eca0bb7b5e
UHC 熱뗫젣茹띾젶業깅젻}熱뗫젣茹띾젶業깅젻{^ 111001101111000010001011111010111010000010011100111001101010101010001101111010111010000010101010111001011111011010110001111010111010000010101110011111011110011011110000100010111110101110100000100111001110011010101010100011011110101110100000101010101110010111110110101100011110101110100000101011100111101101011110 e6f08beba09ce6aa8deba0aae5f6b1eba0ae7de6f08beba09ce6aa8deba0aae5f6b1eba0ae7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)