To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 اn}اn{^ 110110001010011101101110011111011101100010100111011011100111101101011110 d8a76e7dd8a76e7b5e
SJIS-WIN ?§n}?§n{^ 0011111110000001100110000110111001111101001111111000000110011000011011100111101101011110 3f81986e7d3f81986e7b5e
EUC-JP اn}اn{^ 100011111010100110101100101000011111100001101110011111011000111110101001101011001010000111111000011011100111101101011110 8fa9aca1f86e7d8fa9aca1f86e7b5e
UTF-8 اn}اn{^ 11000011100110001100001010100111011011100111110111000011100110001100001010100111011011100111101101011110 c398c2a76e7dc398c2a76e7b5e
UHC اn}اn{^ 10101000101010101010000111010111011011100111110110101000101010101010000111010111011011100111101101011110 a8aaa1d76e7da8aaa1d76e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)