To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???凝?????[???凝?????[^ 0011111100111111001111111000101111000011001111110011111100111111001111110011111101011011001111110011111100111111100010111100001100111111001111110011111100111111001111110101101101011110 3f3f3f8bc33f3f3f3f3f5b3f3f3f8bc33f3f3f3f3f5b5e
EUC-JP 薏??凝?????[薏??凝?????[^ 100011111101100111011110001111110011111110110110110001010011111100111111001111110011111100111111010110111000111111011001110111100011111100111111101101101100010100111111001111110011111100111111001111110101101101011110 8fd9de3f3fb6c53f3f3f3f3f5b8fd9de3f3fb6c53f3f3f3f3f5b5e
UTF-8 薏앸찋凝븐씅硫륁뙷[薏앸찋凝븐씅硫륁뙷[^ 111010001001011010001111111011001001010110111000111011001011000010001011111001011000011110011101111010111011100010010000111011001001010010000101111011111010011110001110111010111010010110000001111010111001100110110111010110111110100010010110100011111110110010010101101110001110110010110000100010111110010110000111100111011110101110111000100100001110110010010100100001011110111110100111100011101110101110100101100000011110101110011001101101110101101101011110 e8968fec95b8ecb08be5879debb890ec9485efa78eeba581eb99b75be8968fec95b8ecb08be5879debb890ec9485efa78eeba581eb99b75b5e
UHC 薏앸찋凝븐씅硫륁뙷[薏앸찋凝븐씅硫륁뙷[^ 111010111111101110011101111010111010100110001111111010111110101010111010111011001001110110011101111010111010100110001111111011001000110010111010010110111110101111111011100111011110101110101001100011111110101111101010101110101110110010011101100111011110101110101001100011111110110010001100101110100101101101011110 ebfb9deba98febeabaec9d9deba98fec8cba5bebfb9deba98febeabaec9d9deba98fec8cba5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)