To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 魄擾ス、魃誹雌陟擾セ殉魄擾ス、魃誹雌陟擾セ旬^ 111010011010111010001111111011111011110110100100111010011010111110010100111011101000111010010011111010001010000010001111111011111011111010001111011111011110100110101110100011111110111110111101101001001110100110101111100101001110111010001110100100111110100010100000100011111110111110111110100011110111101101011110 e9ae8fefbda4e9af94ee8e93e8a08fefbe8f7de9ae8fefbda4e9af94ee8e93e8a08fefbe8f7b5e
EUC-JP 魄擾ス、魃誹雌陟擾セ殉魄擾ス、魃誹雌陟擾セ旬^ 111100101011000010111110111100011000111010111101100011101010010011110010101100011100100011110000101110111111001111110000101000101011111011110001100011101011111010111101110111101111001010110000101111101111000110001110101111011000111010100100111100101011000111001000111100001011101111110011111100001010001010111110111100011000111010111110101111011101110001011110 f2b0bef18ebd8ea4f2b1c8f0bbf3f0a2bef18ebebddef2b0bef18ebd8ea4f2b1c8f0bbf3f0a2bef18ebebddc5e
UTF-8 魄擾ス、魃誹雌陟擾セ殉魄擾ス、魃誹雌陟擾セ旬^ 11101001101011011000010011100110100100111011111011101111101111011011110111101111101111011010010011101001101011011000001111101000101010101011100111101001100110111000110011101001100110011001111111100110100100111011111011101111101111011011111011100110101011101000100111101001101011011000010011100110100100111011111011101111101111011011110111101111101111011010010011101001101011011000001111101000101010101011100111101001100110111000110011101001100110011001111111100110100100111011111011101111101111011011111011100110100101111010110001011110 e9ad84e693beefbdbdefbda4e9ad83e8aab9e99b8ce9999fe693beefbdbee6ae89e9ad84e693beefbdbdefbda4e9ad83e8aab9e99b8ce9999fe693beefbdbee697ac5e
UHC 魄擾??魃誹雌陟擾?殉魄擾??魃誹雌陟擾?旬^ 110110111101111011101000111101100011111100111111110110111010011011011110101001101110110111000001111101001011001111101000111101100011111111100010111001101101101111011110111010001111011000111111001111111101101110100110110111101010011011101101110000011111010010110011111010001111011000111111111000101110001001011110 dbdee8f63f3fdba6dea6edc1f4b3e8f63fe2e6dbdee8f63f3fdba6dea6edc1f4b3e8f63fe2e25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)