To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 嶸ァ巐餓ィ逸國 111110101011010010100111111110101011011010001001111011001010100010001000111011011001101010100000 fab4a7fab689eca888ed9aa0
EUC-JP 嶸ァ巐餓ィ逸國 10001111101110111111010010001110101001111000111110111011111110011011001011101110100011101010100010110000111011111101010010100010 8fbbf48ea78fbbf9b2ee8ea8b0efd4a2
UTF-8 嶸ァ巐餓ィ逸國 111001011011011010111000111011111011110110100111111001011011011110010000111010011010010010010011111011111011110110101000111010011000000010111000111001011001110010001011 e5b6b8efbda7e5b790e9a493efbda8e980b8e59c8b
UHC 嶸??餓?逸國 1110011110101110001111110011111111100100101110110011111111101100111011111100111111010000 e7ae3f3fe4bb3fecefcfd0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)