I am convinced we are on the verge of the first "AI agent worm".

noisytoot@berkeley.edu.pl

@cwebber @vv A local model would be extremely noticeable (far too much CPU/memory/disk space usage), at least if a computer you regularly interactively use got infected (rather than some server/IoT device that's been running unattended for years and you forgot about). It would also be easy to mitigate by using slow hardware like a ThinkPad X200 (which would take hours to respond to a single prompt, giving you plenty of time to notice the malware and deal with it)

doomsdayscw@kolektiva.social

@cwebber "Ha ha!"

aeva@mastodon.gamedev.place

@cwebber so I'm following this right, it sounds like the project or its maintainers don't even necessarily need to even be using LLM tools, the attack pattern simply targets contributors who are using LLM development tools? and so all that is really needed is for the payload to be subtle and the maintainer to be sufficiently overwhelmed (say, by an endless fire hose of LLM-generated liquid shit slop pull requests)?

cwebber@social.coop

@aeva Yes and it's worse than that: the maintainer doesn't even need to be running these tools on their computer. The attack I linked had Claude's independently-running REVIEW BOT on GitHub commit it via injection attack

csepp@merveilles.town

@cwebber This is making me more worried about Vorta's Claude workflows.
Backup software that handles highly sensitive data would be a prime target for such a supply chain attack.

cwebber@social.coop

@aeva But once that was done, the agent was set up to install on users' devices

So the initial attack vector can literally be "Any AI agent in your stack whatsoever getting tricked" as a pathway for infecting computers everywhere

cwebber@social.coop

@csepp Don't forget about KeePassXC. I dunno if they kept going after this "initial test" or not https://www.reddit.com/r/KeePass/comments/1lnvw6q/keepassxc_codebases_jump_into_generative_ai/

cwebber@social.coop

@csepp And don't forget about LITERALLY MOZILLA FIREFOX

aeva@mastodon.gamedev.place

@cwebber apropos of nothing, is pottery still a big deal for humans? i was thinking this morning that pottery might be a nice career change for me.

bituur_esztreym@pouet.chapril.org

@cwebber @mcc @dandylyons
not forgetting the second post - the one that appropriately begins by "meanwhile" - wasn't conflating anything, it was contrasting the gravity of the situation with the surreallistically ingenuous state of mind of some people.

csepp@merveilles.town

@cwebber Oh shit, I rely on all three of these.
Welppppp. I guess I'll have to start looking into alternative password managers.

tinodidriksen@mastodon.social

Ah, the infinite papirclips scenario.

canageek@wandering.shop

@csepp @cwebber Waterfox is a version of Firefox with all of the AI ripped out, but otherwise up to date with all the security changes and stuff, I think it may also have some additional privacy controls added

cwebber@social.coop

@Canageek @csepp Yes but Firefox itself is now being coded with AI generated commits

dvshkn@social.treehouse.systems

@mttaggart @mcc @cwebber Do we know what is being used for inference? At this point in time it's unlikely that they can use a self-hosted model, so there will be network calls.

canageek@wandering.shop

@cwebber @csepp GOD DAMMIT

cwebber@social.coop

@Canageek @csepp There was a recent thing, I can't find it now, where Mozilla added a commit to their agents thing to say "don't explicitly say when AI agents helped author a commit anymore", probably because they were getting community pushback

as you may have guessed, it got some community pushback

kormachameleon@tech.lgbt

@aeva @cwebber I'm a stokie so my default answer is yes. But the answer might be different for normal people

canageek@wandering.shop

@cwebber @csepp Vivaldi will have the same problem to, shit

mttaggart@infosec.exchange

@dvshkn @mcc @cwebber So the trick here is if you install OpenClaw in secret on a user's machine who isn't checking carefully, you might hide easily in network traffic. Use of tools like Claude Code would make the same API calls, which is likely for users who would be targeted with these attacks.

The real insane part is if multiple instance of OpenClaw were running on the same machine, so not even the process name looked suspicious. But of course process names are a poor indicator and can be changed.