To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 﨟ァ隴偃〒隴録 11111011100111011010011111101000101011011001100011101110100000011010011111101000101011011001100001011110 fb9da7e8ad98ee81a7e8ad985e
EUC-JP ?ァ隴偃〒隴録 00111111100011101010011111110000101011111101000011110000101000101010100111110000101011111100111110111111 3f8ea7f0afd0f0a2a9f0afcfbf
UTF-8 﨟ァ隴偃〒隴録 111011111010100010011111111011111011110110100111111010011001101010110100111001011000000110000011111000111000000010010010111010011001101010110100111010011000110010110010 efa89fefbda7e99ab4e58183e38092e99ab4e98cb2
UHC ???偃??? 0011111100111111001111111110010111100111001111110011111100111111 3f3f3fe5e73f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)