I am convinced we are on the verge of the first "AI agent worm".

vv@solarpunk.moe

@dandylyons @cwebber for sure, but it still takes some level of ability to perform these tasks effectively, which local models, especially anything that can run on a typical machine, struggle with

dandylyons@iosdev.space

@vv @cwebber This is a good point. For now, local models are not proficient at tool calling. I don’t expect that to last for very long though.

reiddragon@fedi.catto.garden

@cwebber In today's episode of "We build the Torment Nexus from the hit novel 'Don't build the Torment Nexus'"...

dandylyons@iosdev.space

@mcc @cwebber The original post was all about an LLM taking non-deterministic shell level actions at runtime. And you conflated that with deterministic code written by an LLM.

What I wrote is very relevant.

arnebab@rollenspiel.social

@cwebber According to #Shadowrun the crash virus is still three years away.

https://shadowrun.fandom.com/wiki/Crash_Virus_of_2029

"Fun" fact: In Shadowrun the Crash Virus learned to kill humans who connected their brains to the net. It was the start of lethal internet input.

aronia@tech.lgbt

@cwebber

The postinstall script installs a legitimate, non-malicious package (OpenClaw). There is no malware to detect.

i beg to differ

mcc@mastodon.social

@dandylyons @cwebber it is about an attack based on covertly deploying LLM development tools, with the possible intent of later using them to leverage a second stage attack. If the LLM development tools were already installed, installing openclaw would not have been necessary and the attack could have worked a different way. We are discussing a situation where *the developer of a piece of software I use merely having LLM tools on their computer represents a risk to me*

cwebber@social.coop

@mcc exactly put

@dandylyons

mcc@mastodon.social

@dandylyons @cwebber in other words, if Christine's analysis holds, llm development tools create so much downstream risk to your users that *a malicious party would try to covertly install llm development tools for later exploitation*. That is the subject of discussion. Whether it is safe to install these things *at all*.

bonzoesc@m.bonzoesc.net

@aronia @cwebber it's only malware if it's bad for a computer from the silicon part of the periodic table, if it's bad for your carbon computer it's just a sparkling cognitohazard

sandorspruit@mastodon.nl

@cwebber @amirbkhan Oh man. I remember how I, as a student, struggled to help fight a malignant computer virus and “clean” a large office building - while uninformed workers let their kids play on office PC’s to make things worse. This is orders of a magnitude more complicated. Not good.

cmthiede@social.vivaldi.net

@neurobashing @cwebber just what we need, countless Agent Smiths running around.

noisytoot@berkeley.edu.pl

@cwebber @vv A local model would be extremely noticeable (far too much CPU/memory/disk space usage), at least if a computer you regularly interactively use got infected (rather than some server/IoT device that's been running unattended for years and you forgot about). It would also be easy to mitigate by using slow hardware like a ThinkPad X200 (which would take hours to respond to a single prompt, giving you plenty of time to notice the malware and deal with it)

doomsdayscw@kolektiva.social

@cwebber "Ha ha!"

aeva@mastodon.gamedev.place

@cwebber so I'm following this right, it sounds like the project or its maintainers don't even necessarily need to even be using LLM tools, the attack pattern simply targets contributors who are using LLM development tools? and so all that is really needed is for the payload to be subtle and the maintainer to be sufficiently overwhelmed (say, by an endless fire hose of LLM-generated liquid shit slop pull requests)?

cwebber@social.coop

@aeva Yes and it's worse than that: the maintainer doesn't even need to be running these tools on their computer. The attack I linked had Claude's independently-running REVIEW BOT on GitHub commit it via injection attack

csepp@merveilles.town

@cwebber This is making me more worried about Vorta's Claude workflows.
Backup software that handles highly sensitive data would be a prime target for such a supply chain attack.

cwebber@social.coop

@aeva But once that was done, the agent was set up to install on users' devices

So the initial attack vector can literally be "Any AI agent in your stack whatsoever getting tricked" as a pathway for infecting computers everywhere

cwebber@social.coop

@csepp Don't forget about KeePassXC. I dunno if they kept going after this "initial test" or not https://www.reddit.com/r/KeePass/comments/1lnvw6q/keepassxc_codebases_jump_into_generative_ai/

cwebber@social.coop

@csepp And don't forget about LITERALLY MOZILLA FIREFOX