cross-posted from: https://lemmy.world/post/7258145

The tool, called Nightshade, messes up training data in ways that could cause serious damage to image-generating AI models. Is intended as a way to fight back against AI companies that use artists’ work to train their models without the creator’s permission.

ARTICLE - Technology Review

ARTICLE - Mashable

ARTICLE - Gizmodo

The researchers tested the attack on Stable Diffusion’s latest models and on an AI model they trained themselves from scratch. When they fed Stable Diffusion just 50 poisoned images of dogs and then prompted it to create images of dogs itself, the output started looking weird—creatures with too many limbs and cartoonish faces. With 300 poisoned samples, an attacker can manipulate Stable Diffusion to generate images of dogs to look like cats.

    • MartianSands@sh.itjust.works
      link
      fedilink
      arrow-up
      34
      arrow-down
      1
      ·
      6 days ago

      That’s the problem with all of these attempts. They treat these “poisons” as if they work on AI in general, when in fact they’re very specifically created to target specific models.

      Not only will they only work on some AIs, it’s not terribly difficult to modify the AI enough that it needs a different poison

      • UnspecificGravity@piefed.social
        link
        fedilink
        English
        arrow-up
        27
        ·
        6 days ago

        Just like with poisonous creatures in nature its not about just killing everything that tries to eat you its about making it easier to eat something else. Having to CONSTANTLY develop new strategies in order to train their models on artwork increases the cost to maintain this practice. Eventually it raises it high enough that the cost isn’t worth the result.

        • underisk@lemmy.ml
          link
          fedilink
          arrow-up
          14
          ·
          6 days ago

          Eventually it raises it high enough that the cost isn’t worth the result.

          This is the basis for all digital security/encryption.

        • Voroxpete@sh.itjust.works
          link
          fedilink
          arrow-up
          3
          ·
          6 days ago

          Also, I’d really like to know how much additional processing time is required to de-nightshade an image? And how much is required to detect nightshade, if that’s even a different amount? Do you just have to de-nightshade every image to be safe?

          Suppose the workload of de-nightshading is equal to the workload of training on that image. You’ve just doubled training costs. What if it’s four times? Ten times?

          That de-nightshading tool works in a lab, sure, but the real question is if it scales in a practical and cost effective way. Because for each individual artist the cost of applying nightshade is functionally nil, but the cost for detecting / removing it could be extremely high.

          • UnspecificGravity@piefed.social
            link
            fedilink
            English
            arrow-up
            6
            ·
            6 days ago

            Plus they have to keep developing solutions to Nighshade 2 and Nightshade 2.1 and the Deathcap fork etc. etc. An enthusiastically developed open source project with a bunch of forks and versions is not an easy thing for a big lumbering corporation to keep up with. Especially a corporation that is actively trying to replace staff with AI coders.

            • Voroxpete@sh.itjust.works
              link
              fedilink
              arrow-up
              1
              ·
              5 days ago

              There’s also the assymetric failure modes. If nightshade fails, well, we just end up with the current status quo. LLMs get trained on people’s art. But if the tactics to prevent it fail, a very expensive LLM gets poisoned in some specific way. So it’s much more important for the LLM trainers to always succeed than it is for the people developing nightshade variants.

          • ZDL@lazysoci.al
            link
            fedilink
            arrow-up
            1
            ·
            5 days ago

            Well degenerative AI in general doesn’t scale in a practical and cost effective way, so … I think the conclusion for de-nightshading is obvious?

      • finitebanjo@lemmy.world
        link
        fedilink
        arrow-up
        8
        ·
        edit-2
        6 days ago

        Idk, I’m not convinced. A lot of AI develpment gets treated like a black box algorithm. Overconstraining can lead to model collapse due to more convergent behavior than normal.

  • ceenote@lemmy.world
    link
    fedilink
    arrow-up
    26
    arrow-down
    1
    ·
    6 days ago

    I give it one month before Donald Trump signs a bill outlawing the poisoning of AI models with bipartisan support.

      • ceenote@lemmy.world
        link
        fedilink
        arrow-up
        14
        arrow-down
        1
        ·
        edit-2
        6 days ago

        Plenty of politicians on both sides are in big tech’s pockets. When it’s a Democrat it’s a bug, when it’s a Republican it’s a feature.

  • dejected_warp_core@lemmy.world
    link
    fedilink
    arrow-up
    16
    ·
    5 days ago

    What’s old is new again.

    https://en.wikipedia.org/wiki/Fictitious_entry

    Fictitious or fake entries are deliberately incorrect entries in reference works such as dictionaries, encyclopedias, maps, and directories, added by the editors as copyright traps to reveal subsequent plagiarism or copyright infringement. There are more specific terms for particular kinds of fictitious entry, such as Mountweazel, trap street, paper town, phantom settlement, and nihilartikel.[1]

    Disregarding the obscene amount of automation at play, the underlying problem and its remedy remain somewhat the same. Put bad data into a very large dataset, such that it evades cursory scans, and the unaware plagiarists are eventually caught red-handed.

  • ZDL@lazysoci.al
    link
    fedilink
    arrow-up
    9
    ·
    5 days ago

    Isn’t the easiest way to poison the degenerative AI pool to just feed it degenerative AI output?

    • Hildegarde@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      12
      ·
      5 days ago

      This tool is for artists to protect their own works from theft. This tool watermarks the art in a minor way that is difficult for humans to notice, but messes up current AI models that use it as training data.

      Yes AI incest does degrade the models, but that strategy is ineffective at protecting the works of artists.

  • Blaster M@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    1
    ·
    6 days ago

    It’s trivial to defeat this, and at the levels you need to really make it work, your image looks terrible. Don’t publicly share something if you don’t want it to get in a dataset somehow.

    • dejected_warp_core@lemmy.world
      link
      fedilink
      arrow-up
      5
      ·
      5 days ago

      Eh, it’s not hard to hide things away in a website that only a bot (or a very determined human) will find. You don’t have to poison all your images, just enough of them.

    • SereneSadie@quokk.au
      link
      fedilink
      English
      arrow-up
      4
      ·
      5 days ago

      Right, because artists are all about hoarding their work to themselves and not letting anyone get copies ever.

      They’ll have to DRM it by restricting access to being there in person only, no recording devices whatsoever.

      Clearly thats the only logical solution left.