Researchers just mathematically proved that AI can't recursively self-improve its way to superintelligence.

rootwyrm@weird.autos

@dpiponi @Quantensalat @devsimsek that part, that is ultimately a rehash of well-known theory. THAT part IIRC goes back to like the 1940's or 1950's.

And it absolutely rules out all forms of 'self-training.' It is not just mathematically impossible but a total logical fallacy. How can a system with no reference make correct determinations? Simple: it can't.

rootwyrm@weird.autos

@anne_twain @devsimsek this requires two components LLMs do not, cannot, and will not ever have. Intent and originality.
Researchers have done self-modifying targeted things. It takes no time at all for things to become impossible for humans to understand. This does not mean they are better. Usually they weren't. Even when hyper-focused with specific controls.

lioh@social.anoxinon.de

@huxley @devsimsek doesn't scepticism and intuation mitigate each other?

aka_quant_noir@hcommons.social

@alahmnat @devsimsek
I think we're in the billionaire intelligence decline phase. They're going nuts.

paul@notnull.space

@devsimsek excellent. Thanks for the overview!

hermlon@yuustan.space

@devsimsek isn't the idea of self-improving AI that the AI modifies its code, so the underlying algorithm / architecture?

lorxus@yiff.life

@devsimsek @qualia I think you claim too much here. As I understand it, this result deals only with the intrinsic failures of RL-flavored approaches and not things like self-play, let alone problems that might arise from merely very good AI that still outdoes humans economically.

And I largely agree! I'm glad that someone's finally formalized the intuition that synthetic data is sawdust to bulk out real-world data with and more carefully investigated catastrophic forgetting and the general weaknesses of gradient descent.

That said... to what extent did you have Claude write this post? Because the format is... distinctive.

dpiponi@mathstodon.xyz

@Quantensalat @devsimsek For something more formal on this subject see

https://arxiv.org/abs/2601.03220

The abstract starts "Can we learn more from data than existed in the generating process itself?"

rednikki@toot.boston

@devsimsek “slowly forgets what reality looks like.” Sort of like billionaires.

troed@masto.sangberg.se

@devsimsek The existence of humans disprove the paper.

aburka@hachyderm.io

@devsimsek did an LLM write this toot or do LLMs just write like you

anyia@lgbtqia.space

@devsimsek "Don't worry bro, we can totally fix this by adding a committee of expert LLMs to reason about what training data to select, another committee of LLMs to plan the optimal training order, and then a larger one to evaluate the training output. We just need you to sign this cheque for our next three hyperscale GPU data centres..."

resuna@ohai.social

@rootwyrm @dpiponi @Quantensalat @devsimsek

"How can a system with no reference make correct determinations? Simple: it can't."

Especially since it has no model of "correctness" other than "similar to the symbol streams the neural net weights were initialized from".

resuna@ohai.social

@troed @devsimsek

Large language models are fundamentally different from mammals on every level. They do not build models or reason about them. A rat is more "intelligent".

resuna@ohai.social

@rootwyrm @devsimsek

Mark V. Shaney.

wronglang@bayes.club

@Quantensalat @devsimsek the main issue is that unless you maintain an external signal (so human input in the form of token sequences that are actually carefully curated for coherence) the models become more and more incoherent. Sounds like you're on board with that. The next step is that we're quickly devaluing money spent on human creativity and the world is awash in LLM garbage. So the human signal *is* disappearing.

wronglang@bayes.club

@Quantensalat @musicman @devsimsek depends on what you mean by far fetched, certainly nothing as easy as "their more compute at it' which is what made this jump in investment so dramatic.

emma@orbital.horse

@devsimsek so it doesn't get stuck in a local optimum, it hill-climbs a non-existent one?

mike805@noc.social

@musicman @Quantensalat @devsimsek Anyone who ever copied an audio tape (or worse a VHS tape) knows that the copy is always worse than the original. And in the video case, soon unwatchable.

Ever heard a repeating echo on a video meeting that just turns to a buzz? Same phenomenon.

So what you need is an AI that can perform experiments in the real world to learn how to do better whatever it is you want it to do.

Inbreeding animals doesn't work too well either.

rootwyrm@weird.autos

@anne_twain @devsimsek there is no process. There is no intelligence. There never was and there never will be.
It's a bad stochastic parrot written by children who should have been flunked out of 7th grade math and 3rd grade English as illiterate. Used and pushed by people who aren't capable of reviewing a fast food order, or even placing one.

And guess what? All irrelevant because it takes an incomprehensible level of stupidity to even use a tool that fails dangerously constantly.