To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 霑「遲宋霑「遲宋B 111010001011111110100010111001111010110110010001011101101110100010111111101000101110011110101101100100010111011001000010 e8bfa2e7ad9176e8bfa2e7ad917642
EUC-JP 霑「遲宋霑「遲宋B 1111000011000001100011101010001011101110101011111100000111010111111100001100000110001110101000101110111010101111110000011101011101000010 f0c18ea2eeafc1d7f0c18ea2eeafc1d742
UTF-8 霑「遲宋霑「遲宋B 11101001100111001001000111101111101111011010001011101001100000011011001011100101101011101000101111101001100111001001000111101111101111011010001011101001100000011011001011100101101011101000101101000010 e99c91efbda2e981b2e5ae8be99c91efbda2e981b2e5ae8b42
UHC 霑?遲宋霑?遲宋B 111011111100010100111111111100101100000011100001111001001110111111000101001111111111001011000000111000011110010001000010 efc53ff2c0e1e4efc53ff2c0e1e442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)