To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 霑ッ蜀怜密霑ェ譎ヲ 111010001011111110101111111001011000011010010111111001011001011010100111111010001011111110101010111001101001100110100110 e8bfafe58697e596a7e8bfaae699a6
EUC-JP 霑ッ蜀怜密霑ェ譎ヲ 111100001100000110001110101011111110100111100110110011101110011111001100101010011111000011000001100011101010101011101011111110011000111010100110 f0c18eafe9e6cee7cca9f0c18eaaebf98ea6
UTF-8 霑ッ蜀怜密霑ェ譎ヲ 111010011001110010010001111011111011110110101111111010001001110010000000111001101000000010011100111001011010111110000110111010011001110010010001111011111011110110101010111010001010110110001110111011111011110110100110 e99c91efbdafe89c80e6809ce5af86e99c91efbdaae8ad8eefbda6
UHC 霑?蜀怜密霑?譎? 111011111100010100111111111101011011100111010110101110111101101011001011111011111100010100111111111111011101001000111111 efc53ff5b9d6bbdacbefc53ffdd23f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)