To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 šŸ—ÐnfšŸ—Ðn^}YšŸ—ÐnfšŸ—Ðn^}bE 1001101010011111100101111101000001101110011001101001101010011111100101111101000001101110010111100111110101011001100110101001111110010111110100000110111001100110100110101001111110010111110100000110111001011110011111010110001001000101 9a9f97d06e669a9f97d06e5e7d599a9f97d06e669a9f97d06e5e7d6245
SJIS-WIN ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
EUC-JP ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
UTF-8 šŸ—ÐnfšŸ—Ðn^}YšŸ—ÐnfšŸ—Ðn^}bE 110000101001101011000010100111111100001010010111110000111001000001101110011001101100001010011010110000101001111111000010100101111100001110010000011011100101111001111101010110011100001010011010110000101001111111000010100101111100001110010000011011100110011011000010100110101100001010011111110000101001011111000011100100000110111001011110011111010110001001000101 c29ac29fc297c3906e66c29ac29fc297c3906e5e7d59c29ac29fc297c3906e66c29ac29fc297c3906e5e7d6245
UHC ???Ðnf???Ðn^}Y???Ðnf???Ðn^}bE 001111110011111100111111101010001010001001101110011001100011111100111111001111111010100010100010011011100101111001111101010110010011111100111111001111111010100010100010011011100110011000111111001111110011111110101000101000100110111001011110011111010110001001000101 3f3f3fa8a26e663f3f3fa8a26e5e7d593f3f3fa8a26e663f3f3fa8a26e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)