To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 蜊俶キ。霎ー蛛エ 111001011000110110011000111001101011011110100001111010001011111010110000111001011000000110110100 e58d98e6b7a1e8beb0e581b4
EUC-JP 蜊俶キ。霎ー蛛エ 11101001111011011101000011101000100011101011011110001110101000011111000011000000100011101011000011101001111000011000111010110100 e9edd0e88eb78ea1f0c08eb0e9e18eb4
UTF-8 蜊俶キ。霎ー蛛エ 111010001001110010001010111001001011111110110110111011111011110110110111111011111011110110100001111010011001110010001110111011111011110110110000111010001001101110011011111011111011110110110100 e89c8ae4bfb6efbdb7efbda1e99c8eefbdb0e89b9befbdb4
UHC ??????蛛? 001111110011111100111111001111110011111100111111111100011100100000111111 3f3f3f3f3f3ff1c83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)