To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN シス鹿ク釐ッ}シス鹿ク釐ッ{^ 11110000101101001011110011110001100011101011110110001110101011011111000010101111101110001110011111011000101011110111110111110000101101001011110011110001100011101011110110001110101011011111000010101111101110001110011111011000101011110111101101011110 f0b4bcf18ebd8eadf0afb8e7d8af7df0b4bcf18ebd8eadf0afb8e7d8af7b5e
EUC-JP ?シ?ス鹿?ク釐ッ}?シ?ス鹿?ク釐ッ{^ 001111111000111010111100001111111000111010111101101111001010111100111111100011101011100011101110110110101000111010101111011111010011111110001110101111000011111110001110101111011011110010101111001111111000111010111000111011101101101010001110101011110111101101011110 3f8ebc3f8ebdbcaf3f8eb8eeda8eaf7d3f8ebc3f8ebdbcaf3f8eb8eeda8eaf7b5e
UTF-8 シス鹿ク釐ッ}シス鹿ク釐ッ{^ 111011101000000110110011111011111011110110111100111011101000010010001001111011111011110110111101111010011011100110111111111011101000000110101110111011111011110110111000111010011000011110010000111011111011110110101111011111011110111010000001101100111110111110111101101111001110111010000100100010011110111110111101101111011110100110111001101111111110111010000001101011101110111110111101101110001110100110000111100100001110111110111101101011110111101101011110 ee81b3efbdbcee8489efbdbde9b9bfee81aeefbdb8e98790efbdaf7dee81b3efbdbcee8489efbdbde9b9bfee81aeefbdb8e98790efbdaf7b5e
UHC ????鹿??釐?}????鹿??釐?{^ 00111111001111110011111100111111110101101110001100111111001111111101011111101101001111110111110100111111001111110011111100111111110101101110001100111111001111111101011111101101001111110111101101011110 3f3f3f3fd6e33f3fd7ed3f7d3f3f3f3fd6e33f3fd7ed3f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)