To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 惑薹繽惑薹繽^ 10011000011001101110010101010110111000111000111110011000011001101110010101010110111000111000111101011110 9866e556e38f9866e556e38f5e
EUC-JP 惑薹繽惑薹繽^ 11001111110001111110100110110111111001011110111111001111110001111110100110110111111001011110111101011110 cfc7e9b7e5efcfc7e9b7e5ef5e
UTF-8 惑薹繽惑薹繽^ 11100110100000111001000111101000100101101011100111100111101110011011110111100110100000111001000111101000100101101011100111100111101110011011110101011110 e68391e896b9e7b9bde68391e896b9e7b9bd5e
UHC 惑??惑??^ 111110111110001100111111001111111111101111100011001111110011111101011110 fbe33f3ffbe33f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)