To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 短綜銑短綜銑^ 10010010010110101001000110001110100100010100110010010010010110101001000110001110100100010100110001011110 925a918e914c925a918e914c5e
EUC-JP 短綜銑短綜銑^ 11000011101110111100000111101110110000011010110111000011101110111100000111101110110000011010110101011110 c3bbc1eec1adc3bbc1eec1ad5e
UTF-8 短綜銑短綜銑^ 11100111100111111010110111100111101101101001110011101001100010101001000111100111100111111010110111100111101101101001110011101001100010101001000101011110 e79fade7b69ce98a91e79fade7b69ce98a915e
UHC 短綜銑短綜銑^ 11010011101011011111000011111100111000001101010111010011101011011111000011111100111000001101010101011110 d3adf0fce0d5d3adf0fce0d55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)