To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 熱??猥??阿??}熱??猥??阿??{^ 100101000100110100111111001111111110000011001110001111110011111110001000101000100011111100111111011111011001010001001101001111110011111111100000110011100011111100111111100010001010001000111111001111110111101101011110 944d3f3fe0ce3f3f88a23f3f7d944d3f3fe0ce3f3f88a23f3f7b5e
EUC-JP 熱??猥??阿??}熱??猥??阿??{^ 110001111010111000111111001111111110000011010000001111110011111110110000101001000011111100111111011111011100011110101110001111110011111111100000110100000011111100111111101100001010010000111111001111110111101101011110 c7ae3f3fe0d03f3fb0a43f3f7dc7ae3f3fe0d03f3fb0a43f3f7b5e
UTF-8 熱뗫졁猥띾젛阿숇젇}熱뗫졁猥띾젛阿숇젇{^ 111001111000011010110001111010111001011110101011111011001010000110000001111001111000110010100101111010111001110110111110111011001010000010011011111010011001100010111111111011001000100010000111111011001010000010000111011111011110011110000110101100011110101110010111101010111110110010100001100000011110011110001100101001011110101110011101101111101110110010100000100110111110100110011000101111111110110010001000100001111110110010100000100001110111101101011110 e786b1eb97abeca181e78ca5eb9dbeeca09be998bfec8887eca0877de786b1eb97abeca181e78ca5eb9dbeeca09be998bfec8887eca0877b5e
UHC 熱뗫졁猥띾젛阿숇젇}熱뗫졁猥띾젛阿숇젇{^ 111001101111000010001011111010111010000010110010111010001110010110001101111010111010000010010111111001001011100110011001111010111010000010001010011111011110011011110000100010111110101110100000101100101110100011100101100011011110101110100000100101111110010010111001100110011110101110100000100010100111101101011110 e6f08beba0b2e8e58deba097e4b999eba08a7de6f08beba0b2e8e58deba097e4b999eba08a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)