To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ëû¨Dëû¨D^ 111010111111101110101000010001001110101111111011101010000100010001011110 ebfba844ebfba8445e
SJIS-WIN ??¨D??¨D^ 0011111100111111100000010100111001000100001111110011111110000001010011100100010001011110 3f3f814e443f3f814e445e
EUC-JP ëû¨Dëû¨D^ 10001111101010111011001110001111101010111110010110100001101011110100010010001111101010111011001110001111101010111110010110100001101011110100010001011110 8fabb38fabe5a1af448fabb38fabe5a1af445e
UTF-8 ëû¨Dëû¨D^ 110000111010101111000011101110111100001010101000010001001100001110101011110000111011101111000010101010000100010001011110 c3abc3bbc2a844c3abc3bbc2a8445e
UHC ??¨D??¨D^ 0011111100111111101000011010011101000100001111110011111110100001101001110100010001011110 3f3fa1a7443f3fa1a7445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)