Skip to content
  • Hjem
  • Seneste
  • Etiketter
  • Populære
  • Verden
  • Bruger
  • Grupper
Temaer
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Kollaps
FARVEL BIG TECH
  1. Forside
  2. Ikke-kategoriseret
  3. I am convinced we are on the verge of the first "AI agent worm".

I am convinced we are on the verge of the first "AI agent worm".

Planlagt Fastgjort Låst Flyttet Ikke-kategoriseret
117 Indlæg 53 Posters 0 Visninger
  • Ældste til nyeste
  • Nyeste til ældste
  • Most Votes
Svar
  • Svar som emne
Login for at svare
Denne tråd er blevet slettet. Kun brugere med emne behandlings privilegier kan se den.
  • cwebber@social.coopC cwebber@social.coop

    @vv Yeah. I mean, local models *might* be able to pull this off but right now Claude is the most likely candidate, it's the most capable. But even then, the most capable open model that is capable of doing such damage on its own is somewhere around a gigabyte, not a small download.

    (But, people download huge things all the time, so not completely infeasible either.)

    noisytoot@berkeley.edu.plN This user is from outside of this forum
    noisytoot@berkeley.edu.plN This user is from outside of this forum
    noisytoot@berkeley.edu.pl
    wrote sidst redigeret af
    #44
    @cwebber @vv A local model would be extremely noticeable (far too much CPU/memory/disk space usage), at least if a computer you regularly interactively use got infected (rather than some server/IoT device that's been running unattended for years and you forgot about). It would also be easy to mitigate by using slow hardware like a ThinkPad X200 (which would take hours to respond to a single prompt, giving you plenty of time to notice the malware and deal with it)
    1 Reply Last reply
    0
    • cwebber@social.coopC cwebber@social.coop

      I am convinced we are on the verge of the first "AI agent worm". This looks like the closest hint of it, though it isn't it quite itself: an attack on a PR agent that got it to set up to install openclaw with full access on 4k machines https://grith.ai/blog/clinejection-when-your-ai-tool-installs-another

      But, the agents installed weren't given instructions to *do* anything yet.

      Soon they will be. And when they are, the havoc will be massive. Unlike traditional worms, where you're looking for the typically byte-for-byte identical worm embedded in the system, an agent worm can do different, nondeterministic things on every install, and carry out a global action.

      I suspect we're months away from seeing the first agent worm, *if* that. There may already be some happening right now in FOSS projects, undetected.

      doomsdayscw@kolektiva.socialD This user is from outside of this forum
      doomsdayscw@kolektiva.socialD This user is from outside of this forum
      doomsdayscw@kolektiva.social
      wrote sidst redigeret af
      #45

      @cwebber "Ha ha!"

      1 Reply Last reply
      0
      • jwcph@helvede.netJ jwcph@helvede.net shared this topic
      • cwebber@social.coopC cwebber@social.coop

        I know some people are thinking "well pulling off this kind of thing, it would have to be controlled with intent of a human actor"

        It doesn't have to be.

        1. A human could *kick off* such a process, and then it runs away from them.
        2. It wouldn't even require a specific prompt to kick off a worm. There's enough scifi out there for this to be something any one of the barely-monitored openclaw agents could determine it should do.

        Whether it's kicked off by a human explicitly or a stray agent, it doesn't require "intentionality". Biological viruses don't have interiority / intentionality, and yet are major threats that reproduce and adapt.

        aeva@mastodon.gamedev.placeA This user is from outside of this forum
        aeva@mastodon.gamedev.placeA This user is from outside of this forum
        aeva@mastodon.gamedev.place
        wrote sidst redigeret af
        #46

        @cwebber so I'm following this right, it sounds like the project or its maintainers don't even necessarily need to even be using LLM tools, the attack pattern simply targets contributors who are using LLM development tools? and so all that is really needed is for the payload to be subtle and the maintainer to be sufficiently overwhelmed (say, by an endless fire hose of LLM-generated liquid shit slop pull requests)?

        cwebber@social.coopC violetmadder@kolektiva.socialV 2 Replies Last reply
        0
        • aeva@mastodon.gamedev.placeA aeva@mastodon.gamedev.place

          @cwebber so I'm following this right, it sounds like the project or its maintainers don't even necessarily need to even be using LLM tools, the attack pattern simply targets contributors who are using LLM development tools? and so all that is really needed is for the payload to be subtle and the maintainer to be sufficiently overwhelmed (say, by an endless fire hose of LLM-generated liquid shit slop pull requests)?

          cwebber@social.coopC This user is from outside of this forum
          cwebber@social.coopC This user is from outside of this forum
          cwebber@social.coop
          wrote sidst redigeret af
          #47

          @aeva Yes and it's worse than that: the maintainer doesn't even need to be running these tools on their computer. The attack I linked had Claude's independently-running REVIEW BOT on GitHub commit it via injection attack

          cwebber@social.coopC 1 Reply Last reply
          0
          • cwebber@social.coopC cwebber@social.coop

            I am convinced we are on the verge of the first "AI agent worm". This looks like the closest hint of it, though it isn't it quite itself: an attack on a PR agent that got it to set up to install openclaw with full access on 4k machines https://grith.ai/blog/clinejection-when-your-ai-tool-installs-another

            But, the agents installed weren't given instructions to *do* anything yet.

            Soon they will be. And when they are, the havoc will be massive. Unlike traditional worms, where you're looking for the typically byte-for-byte identical worm embedded in the system, an agent worm can do different, nondeterministic things on every install, and carry out a global action.

            I suspect we're months away from seeing the first agent worm, *if* that. There may already be some happening right now in FOSS projects, undetected.

            csepp@merveilles.townC This user is from outside of this forum
            csepp@merveilles.townC This user is from outside of this forum
            csepp@merveilles.town
            wrote sidst redigeret af
            #48

            @cwebber This is making me more worried about Vorta's Claude workflows.
            Backup software that handles highly sensitive data would be a prime target for such a supply chain attack.

            cwebber@social.coopC 1 Reply Last reply
            0
            • cwebber@social.coopC cwebber@social.coop

              @aeva Yes and it's worse than that: the maintainer doesn't even need to be running these tools on their computer. The attack I linked had Claude's independently-running REVIEW BOT on GitHub commit it via injection attack

              cwebber@social.coopC This user is from outside of this forum
              cwebber@social.coopC This user is from outside of this forum
              cwebber@social.coop
              wrote sidst redigeret af
              #49

              @aeva But once that was done, the agent was set up to install on users' devices

              So the initial attack vector can literally be "Any AI agent in your stack whatsoever getting tricked" as a pathway for infecting computers everywhere

              aeva@mastodon.gamedev.placeA 1 Reply Last reply
              0
              • csepp@merveilles.townC csepp@merveilles.town

                @cwebber This is making me more worried about Vorta's Claude workflows.
                Backup software that handles highly sensitive data would be a prime target for such a supply chain attack.

                cwebber@social.coopC This user is from outside of this forum
                cwebber@social.coopC This user is from outside of this forum
                cwebber@social.coop
                wrote sidst redigeret af
                #50

                @csepp Don't forget about KeePassXC. I dunno if they kept going after this "initial test" or not https://www.reddit.com/r/KeePass/comments/1lnvw6q/keepassxc_codebases_jump_into_generative_ai/

                cwebber@social.coopC 1 Reply Last reply
                0
                • cwebber@social.coopC cwebber@social.coop

                  @csepp Don't forget about KeePassXC. I dunno if they kept going after this "initial test" or not https://www.reddit.com/r/KeePass/comments/1lnvw6q/keepassxc_codebases_jump_into_generative_ai/

                  cwebber@social.coopC This user is from outside of this forum
                  cwebber@social.coopC This user is from outside of this forum
                  cwebber@social.coop
                  wrote sidst redigeret af
                  #51

                  @csepp And don't forget about LITERALLY MOZILLA FIREFOX

                  csepp@merveilles.townC 1 Reply Last reply
                  0
                  • cwebber@social.coopC cwebber@social.coop

                    @aeva But once that was done, the agent was set up to install on users' devices

                    So the initial attack vector can literally be "Any AI agent in your stack whatsoever getting tricked" as a pathway for infecting computers everywhere

                    aeva@mastodon.gamedev.placeA This user is from outside of this forum
                    aeva@mastodon.gamedev.placeA This user is from outside of this forum
                    aeva@mastodon.gamedev.place
                    wrote sidst redigeret af
                    #52

                    @cwebber apropos of nothing, is pottery still a big deal for humans? i was thinking this morning that pottery might be a nice career change for me.

                    kormachameleon@tech.lgbtK lispi314@udongein.xyzL ryanprior@mastodon.socialR 3 Replies Last reply
                    0
                    • cwebber@social.coopC cwebber@social.coop

                      @mcc exactly put

                      @dandylyons

                      bituur_esztreym@pouet.chapril.orgB This user is from outside of this forum
                      bituur_esztreym@pouet.chapril.orgB This user is from outside of this forum
                      bituur_esztreym@pouet.chapril.org
                      wrote sidst redigeret af
                      #53

                      @cwebber @mcc @dandylyons
                      not forgetting the second post - the one that appropriately begins by "meanwhile" - wasn't conflating anything, it was contrasting the gravity of the situation with the surreallistically ingenuous state of mind of some people.

                      1 Reply Last reply
                      0
                      • cwebber@social.coopC cwebber@social.coop

                        @csepp And don't forget about LITERALLY MOZILLA FIREFOX

                        csepp@merveilles.townC This user is from outside of this forum
                        csepp@merveilles.townC This user is from outside of this forum
                        csepp@merveilles.town
                        wrote sidst redigeret af
                        #54

                        @cwebber Oh shit, I rely on all three of these.
                        Welppppp. I guess I'll have to start looking into alternative password managers.

                        canageek@wandering.shopC 1 Reply Last reply
                        0
                        • cwebber@social.coopC cwebber@social.coop

                          I am convinced we are on the verge of the first "AI agent worm". This looks like the closest hint of it, though it isn't it quite itself: an attack on a PR agent that got it to set up to install openclaw with full access on 4k machines https://grith.ai/blog/clinejection-when-your-ai-tool-installs-another

                          But, the agents installed weren't given instructions to *do* anything yet.

                          Soon they will be. And when they are, the havoc will be massive. Unlike traditional worms, where you're looking for the typically byte-for-byte identical worm embedded in the system, an agent worm can do different, nondeterministic things on every install, and carry out a global action.

                          I suspect we're months away from seeing the first agent worm, *if* that. There may already be some happening right now in FOSS projects, undetected.

                          tinodidriksen@mastodon.socialT This user is from outside of this forum
                          tinodidriksen@mastodon.socialT This user is from outside of this forum
                          tinodidriksen@mastodon.social
                          wrote sidst redigeret af
                          #55

                          Ah, the infinite papirclips scenario.

                          1 Reply Last reply
                          0
                          • csepp@merveilles.townC csepp@merveilles.town

                            @cwebber Oh shit, I rely on all three of these.
                            Welppppp. I guess I'll have to start looking into alternative password managers.

                            canageek@wandering.shopC This user is from outside of this forum
                            canageek@wandering.shopC This user is from outside of this forum
                            canageek@wandering.shop
                            wrote sidst redigeret af
                            #56

                            @csepp @cwebber Waterfox is a version of Firefox with all of the AI ripped out, but otherwise up to date with all the security changes and stuff, I think it may also have some additional privacy controls added

                            cwebber@social.coopC 1 Reply Last reply
                            0
                            • canageek@wandering.shopC canageek@wandering.shop

                              @csepp @cwebber Waterfox is a version of Firefox with all of the AI ripped out, but otherwise up to date with all the security changes and stuff, I think it may also have some additional privacy controls added

                              cwebber@social.coopC This user is from outside of this forum
                              cwebber@social.coopC This user is from outside of this forum
                              cwebber@social.coop
                              wrote sidst redigeret af
                              #57

                              @Canageek @csepp Yes but Firefox itself is now being coded with AI generated commits

                              canageek@wandering.shopC 1 Reply Last reply
                              0
                              • mttaggart@infosec.exchangeM mttaggart@infosec.exchange

                                @mcc @cwebber You could, but I would not recommend doing so. Instead perhaps a purposed YARA lookup with a single rule to look for the filename/string? Not sure why you'd be so restrictive on detections, but you can.

                                dvshkn@social.treehouse.systemsD This user is from outside of this forum
                                dvshkn@social.treehouse.systemsD This user is from outside of this forum
                                dvshkn@social.treehouse.systems
                                wrote sidst redigeret af
                                #58

                                @mttaggart @mcc @cwebber Do we know what is being used for inference? At this point in time it's unlikely that they can use a self-hosted model, so there will be network calls.

                                mttaggart@infosec.exchangeM mcc@mastodon.socialM 2 Replies Last reply
                                0
                                • cwebber@social.coopC cwebber@social.coop

                                  @Canageek @csepp Yes but Firefox itself is now being coded with AI generated commits

                                  canageek@wandering.shopC This user is from outside of this forum
                                  canageek@wandering.shopC This user is from outside of this forum
                                  canageek@wandering.shop
                                  wrote sidst redigeret af
                                  #59

                                  @cwebber @csepp GOD DAMMIT

                                  cwebber@social.coopC 1 Reply Last reply
                                  0
                                  • canageek@wandering.shopC canageek@wandering.shop

                                    @cwebber @csepp GOD DAMMIT

                                    cwebber@social.coopC This user is from outside of this forum
                                    cwebber@social.coopC This user is from outside of this forum
                                    cwebber@social.coop
                                    wrote sidst redigeret af
                                    #60

                                    @Canageek @csepp There was a recent thing, I can't find it now, where Mozilla added a commit to their agents thing to say "don't explicitly say when AI agents helped author a commit anymore", probably because they were getting community pushback

                                    as you may have guessed, it got some community pushback

                                    canageek@wandering.shopC png@yap.pony.bizP 2 Replies Last reply
                                    0
                                    • aeva@mastodon.gamedev.placeA aeva@mastodon.gamedev.place

                                      @cwebber apropos of nothing, is pottery still a big deal for humans? i was thinking this morning that pottery might be a nice career change for me.

                                      kormachameleon@tech.lgbtK This user is from outside of this forum
                                      kormachameleon@tech.lgbtK This user is from outside of this forum
                                      kormachameleon@tech.lgbt
                                      wrote sidst redigeret af
                                      #61

                                      @aeva @cwebber I'm a stokie so my default answer is yes. But the answer might be different for normal people

                                      aeva@mastodon.gamedev.placeA 1 Reply Last reply
                                      0
                                      • cwebber@social.coopC cwebber@social.coop

                                        @Canageek @csepp There was a recent thing, I can't find it now, where Mozilla added a commit to their agents thing to say "don't explicitly say when AI agents helped author a commit anymore", probably because they were getting community pushback

                                        as you may have guessed, it got some community pushback

                                        canageek@wandering.shopC This user is from outside of this forum
                                        canageek@wandering.shopC This user is from outside of this forum
                                        canageek@wandering.shop
                                        wrote sidst redigeret af
                                        #62

                                        @cwebber @csepp Vivaldi will have the same problem to, shit

                                        cwebber@social.coopC 1 Reply Last reply
                                        0
                                        • dvshkn@social.treehouse.systemsD dvshkn@social.treehouse.systems

                                          @mttaggart @mcc @cwebber Do we know what is being used for inference? At this point in time it's unlikely that they can use a self-hosted model, so there will be network calls.

                                          mttaggart@infosec.exchangeM This user is from outside of this forum
                                          mttaggart@infosec.exchangeM This user is from outside of this forum
                                          mttaggart@infosec.exchange
                                          wrote sidst redigeret af
                                          #63

                                          @dvshkn @mcc @cwebber So the trick here is if you install OpenClaw in secret on a user's machine who isn't checking carefully, you might hide easily in network traffic. Use of tools like Claude Code would make the same API calls, which is likely for users who would be targeted with these attacks.

                                          The real insane part is if multiple instance of OpenClaw were running on the same machine, so not even the process name looked suspicious. But of course process names are a poor indicator and can be changed.

                                          tiotasram@kolektiva.socialT 1 Reply Last reply
                                          0
                                          Svar
                                          • Svar som emne
                                          Login for at svare
                                          • Ældste til nyeste
                                          • Nyeste til ældste
                                          • Most Votes


                                          • Log ind

                                          • Har du ikke en konto? Tilmeld

                                          • Login or register to search.
                                          Powered by NodeBB Contributors
                                          Graciously hosted by data.coop
                                          • First post
                                            Last post
                                          0
                                          • Hjem
                                          • Seneste
                                          • Etiketter
                                          • Populære
                                          • Verden
                                          • Bruger
                                          • Grupper