To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 šHšXun}šHšXun{^ 100110100100100010011010010110000111010101101110011111011001101001001000100110100101100001110101011011100111101101011110 9a489a58756e7d9a489a58756e7b5e
SJIS-WIN ?H?Xun}?H?Xun{^ 001111110100100000111111010110000111010101101110011111010011111101001000001111110101100001110101011011100111101101011110 3f483f58756e7d3f483f58756e7b5e
EUC-JP ?H?Xun}?H?Xun{^ 001111110100100000111111010110000111010101101110011111010011111101001000001111110101100001110101011011100111101101011110 3f483f58756e7d3f483f58756e7b5e
UTF-8 šHšXun}šHšXun{^ 11000010100110100100100011000010100110100101100001110101011011100111110111000010100110100100100011000010100110100101100001110101011011100111101101011110 c29a48c29a58756e7dc29a48c29a58756e7b5e
UHC ?H?Xun}?H?Xun{^ 001111110100100000111111010110000111010101101110011111010011111101001000001111110101100001110101011011100111101101011110 3f483f58756e7d3f483f58756e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)