To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 巐麗嘯巐麗嘯^ 11111010101101101001011111101101100110101000001011111010101101101001011111101101100110101000001001011110 fab697ed9a82fab697ed9a825e
EUC-JP 巐麗嘯巐麗嘯^ 100011111011101111111001110011101110111111010011111000101000111110111011111110011100111011101111110100111110001001011110 8fbbf9ceefd3e28fbbf9ceefd3e25e
UTF-8 巐麗嘯巐麗嘯^ 11100101101101111001000011101001101110101001011111100101100110001010111111100101101101111001000011101001101110101001011111100101100110001010111101011110 e5b790e9ba97e598afe5b790e9ba97e598af5e
UHC ?麗嘯?麗嘯^ 0011111111010101111100101110000110110000001111111101010111110010111000011011000001011110 3fd5f2e1b03fd5f2e1b05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)