To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????D????D^ 0011111100111111001111110011111101000100001111110011111100111111001111110100010001011110 3f3f3f3f443f3f3f3f445e
SJIS-WIN 汚??肯D汚??肯D^ 100010011001100000111111001111111000110101101101010001001000100110011000001111110011111110001101011011010100010001011110 89983f3f8d6d4489983f3f8d6d445e
EUC-JP 汚??肯D汚??肯D^ 101100011111100000111111001111111011100111001110010001001011000111111000001111110011111110111001110011100100010001011110 b1f83f3fb9ce44b1f83f3fb9ce445e
UTF-8 汚믣퉼肯D汚믣퉼肯D^ 111001101011000110011010111010111010111110100011111011011000100110111100111010001000001010101111010001001110011010110001100110101110101110101111101000111110110110001001101111001110100010000010101011110100010001011110 e6b19aebafa3ed89bce882af44e6b19aebafa3ed89bce882af445e
UHC 汚믣퉼肯D汚믣퉼肯D^ 11100111111111011001001011100101101110011001010011010000111010010100010011100111111111011001001011100101101110011001010011010000111010010100010001011110 e7fd92e5b994d0e944e7fd92e5b994d0e9445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)