To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 å^kW}å^kW{^ 11000011101001010101111001101011010101110111110111000011101001010101111001101011010101110111101101011110 c3a55e6b577dc3a55e6b577b5e
SJIS-WIN ?¥^kW}?¥^kW{^ 001111111000000110001111010111100110101101010111011111010011111110000001100011110101111001101011010101110111101101011110 3f818f5e6b577d3f818f5e6b577b5e
EUC-JP Ã?^kW}Ã?^kW{^ 1000111110101010101010100011111101011110011010110101011101111101100011111010101010101010001111110101111001101011010101110111101101011110 8faaaa3f5e6b577d8faaaa3f5e6b577b5e
UTF-8 å^kW}å^kW{^ 1100001110000011110000101010010101011110011010110101011101111101110000111000001111000010101001010101111001101011010101110111101101011110 c383c2a55e6b577dc383c2a55e6b577b5e
UHC ??^kW}??^kW{^ 00111111001111110101111001101011010101110111110100111111001111110101111001101011010101110111101101011110 3f3f5e6b577d3f3f5e6b577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)