all the criticism has been said, all the takes been had.

jonny@neuromatch.social

@elebertus
Ive read so much LLM code at this point, there are still patterns that are present but elude my understanding, but one thing that's clear is that there are foundational flaw categories that are not improved upon by model version and appear in wildly different projects using wildly different models and harnesses. Testing is a big nexus of those flaws. I am not close to what would be a satisfying explanation of the dynamics, but every project suffers fucked testing problems.

eliocamp@mastodon.social

@jonny One thing I love about this and other posts linking to slop on GitHub is that more often than not I flat out can't follow the link because GitHub is not working.

bstacey@icosahedron.website

@jonny I struggle to express how bleak this is.

bstacey@icosahedron.website

@jonny It's like everyone decided to take a bath in mercury and leaded gasoline.

bipolaron@scholar.social

@bstacey @jonny with a plugged in datacenter

poleguy@mastodon.social

@jonny I just lost my beer league hockey championship as the last shooter on a 14 round shoot out. I'm sitting in my driveway reading your thread. I'll need to read it again in the morning.

I don't remember why I followed you originally. But I love this thread.

This whole rsync thing is the most interesting thing that has come out of the ai bubble.

I had a negative feel for rsync after years ago reading a blog criticizing its sloppy design.

Yet I rely on it daily. I have so many questions.

jonny@neuromatch.social

@poleguy
RIP on the shootout, hopefully the other team bought the beer and you got to pinch the other goalies cheek a bit. You'll get em next season

poleguy@mastodon.social

@jonny indeed, that's the right feeling!

We have sponsorship from a brewery, so the locker room beer (and custom jerseys) are "free."

But we sat at the bar with the other team. It is just a game after all.

Both sides had a good time. And we had fans cheering for both sides. And kids crashing the locker room to celebrate despite the loss... We shared our NA options. Can't ask for more.

I'd love to engage more on this thread technically... I have thoughts. Maybe Monday.

jens@social.finkhaeuser.de

@jonny related:

https://finkhaeuser.de/2026-04-10-outsourcing-thought-is-going-great/

jens@social.finkhaeuser.de

@jonny @ainmosni Gambling (addiction) works on the so-called Variable Reinforcement Schedule.

The TL;DR of it is, results are random enough that even though it seems there may be a pattern, there isn't. You're pulled in because "one more time will show my pattern detection was right".

And since human brains are excellent pattern detection machines, every time this succeeds yields huge dopamine rewards.

I'm pissed off with the pattern, which is why I stop. But I can't deny its power.

themipper@mastodon.social

@jonny this whole thing is so bad that the only viable way seems to fork it before the LLM sloppening. It is a shame to see more and more foundational projects fall into the LLM trap.

And as always you hit the nail on the head with your deep dive and explanations. I love reading them.

I will use your observation on how for a LLM what is written is the same as what is happening.

gunchleoc@mastodon.scot

@jonny Of course it's a smoke test - as in "smoke and mirrors"

WTAF.

fluffy@plush.city

@jonny also why the hell would they write tests for a C program/library in Python? It makes no sense.

spitfire@mastodon.de

@jonny holy crap this story gets worse by the day. Thank you very much for summing-up this aspect of the situation for a non-sw-engineering-person like me. 🫡

fluffy@plush.city

@jonny ... and why the everloving FUCK do these tests run as root

dahukanna@mastodon.social

@jonny Referencing
1. @shauna post based on @DGI about power dynamics & dysfunction between imaginary labour(iML) & interpretive labour(iNL)-https://www.rethinkingpower.info/how-interpretive-labor-straddles-the-gap-between-rules-and-reality/
2. Power, chapter 4 of Mary Parker Follet’s Dynamic administration - https://mastodon.social/@dahukanna/110643444784446704

Presuming Productivity(P)=(iML/iNL)
dysfunctional power-over tool imposition e.g. LLM, factory production,etc
- Imagined abstract: 1 LLM PR/0 review units= ∞P
- Interpreted reality: 1 LLM PR/>10 review units=0.1P
-https://mastodon.social/@dahukanna/113230734549577353

technocrow@blahaj.zone

@fluffy@plush.city @jonny@neuromatch.social running tests as root is fucking wild

sesamzoo@mastodon.social · finkhaeuser.de/2026-04-10-outs

@jens, great article, thank you. Did you pull the lever "just one more time" and if so, did it get even worse?

@jonny, thank you for this thread and lots of your other threads on the topic.

Both help feeling that I'm not the ghost driver although these days there is lot of contraflow on my lane. Mostly at work where the AI fanboys/believers/addicts are at least way louder than the people trying to understand and keeping their code in maintainable shape.

jens@social.finkhaeuser.de

@sesamzoo @jonny No, I did not. I try to use LLMs not at all, so I really did maybe one or two more queries more than described, just to get a feel for it.

jens@social.finkhaeuser.de

@sesamzoo @jonny Also, I feel dirty every time, so I don't want to waste water crying in the shower afterwards.