To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 蠖「鮴怡蠖「鮴悳^ 111001011011110110100010111010011011111010011100011111011110010110111101101000101110100110111110100111000111101101011110 e5bda2e9be9c7de5bda2e9be9c7b5e
EUC-JP 蠖「鮴怡蠖「鮴悳^ 1110101010111111100011101010001011110010110000001101011111011110111010101011111110001110101000101111001011000000110101111101110001011110 eabf8ea2f2c0d7deeabf8ea2f2c0d7dc5e
UTF-8 蠖「鮴怡蠖「鮴悳^ 11101000101000001001011011101111101111011010001011101001101011101011010011100110100000001010000111101000101000001001011011101111101111011010001011101001101011101011010011100110100000101011001101011110 e8a096efbda2e9aeb4e680a1e8a096efbda2e9aeb4e682b35e
UHC ???怡???悳^ 0011111100111111001111111110110010101110001111110011111100111111110100111110110101011110 3f3f3fecae3f3f3fd3ed5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)