To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ßÞãÛãÆßÞB 110111111101111011100011110110111110001111000110110111111101111001000010 dfdee3dbe3c6dfde42
SJIS-WIN ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
EUC-JP ßÞãÛãÆßÞB 10001111101010011100111010001111101010011011000010001111101010111010101010001111101010101110010110001111101010111010101010001111101010011010000110001111101010011100111010001111101010011011000001000010 8fa9ce8fa9b08fabaa8faae58fabaa8fa9a18fa9ce8fa9b042
UTF-8 ßÞãÛãÆßÞB 1100001110011111110000111001111011000011101000111100001110011011110000111010001111000011100001101100001110011111110000111001111001000010 c39fc39ec3a3c39bc3a3c386c39fc39e42
UHC ßÞ???ÆßÞB 1010100110101100101010001010110100111111001111110011111110101000101000011010100110101100101010001010110101000010 a9aca8ad3f3f3fa8a1a9aca8ad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)