To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 騾托スヲ郢ァ螫「豎嘲騾托スヲ郢ァ螫「豎嘴^ 111010011000000010010001111011111011110110100110111001111011100110100111111001011010110010100010111001101011000110011010011111011110100110000000100100011110111110111101101001101110011110111001101001111110010110101100101000101110011010110001100110100111101101011110 e98091efbda6e7b9a7e5aca2e6b19a7de98091efbda6e7b9a7e5aca2e6b19a7b5e
EUC-JP 騾托スヲ郢ァ螫「豎嘲騾托スヲ郢ァ螫「豎嘴^ 1111000111100000110000101111000110001110101111011000111010100110111011101011101110001110101001111110101010101110100011101010001011101100101100111101001111011110111100011110000011000010111100011000111010111101100011101010011011101110101110111000111010100111111010101010111010001110101000101110110010110011110100111101110001011110 f1e0c2f18ebd8ea6eebb8ea7eaae8ea2ecb3d3def1e0c2f18ebd8ea6eebb8ea7eaae8ea2ecb3d3dc5e
UTF-8 騾托スヲ郢ァ螫「豎嘲騾托スヲ郢ァ螫「豎嘴^ 11101001101010001011111011100110100010011001100011101111101111011011110111101111101111011010011011101001100000111010001011101111101111011010011111101000100111101010101111101111101111011010001011101000101100011000111011100101100110001011001011101001101010001011111011100110100010011001100011101111101111011011110111101111101111011010011011101001100000111010001011101111101111011010011111101000100111101010101111101111101111011010001011101000101100011000111011100101100110001011010001011110 e9a8bee68998efbdbdefbda6e983a2efbda7e89eabefbda2e8b18ee598b2e9a8bee68998efbdbdefbda6e983a2efbda7e89eabefbda2e8b18ee598b45e
UHC ?托???????嘲?托???????嘴^ 00111111111101101111010100111111001111110011111100111111001111110011111100111111111100001011111100111111111101101111010100111111001111110011111100111111001111110011111100111111111101101010010001011110 3ff6f53f3f3f3f3f3f3ff0bf3ff6f53f3f3f3f3f3f3ff6a45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)