To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 症ヌ汐鴆竺疾 10001111110001111100011110001110101011001110100111101111100011101011000111110010101100101000111010111110 8fc7c78eace9ef8eb1f2b28ebe
EUC-JP 症ヌ汐鴆竺?疾 10111110110010011000111011000111101111001010111011110010111100011011110010110011001111111011110011000000 bec98ec7bcaef2f1bcb33fbcc0
UTF-8 症ヌ汐鴆竺疾 111001111001011110000111111011111011111010000111111001101011000110010000111010011011010010000110111001111010101110111010111011101000011110101001111001111001011010111110 e79787efbe87e6b190e9b486e7abbaee87a9e796be
UHC 症?汐?竺?疾 1111000111111000001111111110000010110001001111111111010111100111001111111111001011110000 f1f83fe0b13ff5e73ff2f0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)