To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 霑ェ迢思霑ェ迢思B 111010001011111110101010111001111000101110001110011101101110100010111111101010101110011110001011100011100111011001000010 e8bfaae78b8e76e8bfaae78b8e7642
EUC-JP 霑ェ迢思霑ェ迢思B 1111000011000001100011101010101011101101111010111011101111010111111100001100000110001110101010101110110111101011101110111101011101000010 f0c18eaaedebbbd7f0c18eaaedebbbd742
UTF-8 霑ェ迢思霑ェ迢思B 11101001100111001001000111101111101111011010101011101000101111111010001011100110100000001001110111101001100111001001000111101111101111011010101011101000101111111010001011100110100000001001110101000010 e99c91efbdaae8bfa2e6809de99c91efbdaae8bfa2e6809d42
UHC 霑??思霑??思B 11101111110001010011111100111111110111101101011011101111110001010011111100111111110111101101011001000010 efc53f3fded6efc53f3fded642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)