👀 … https://sfconservancy.org/blog/2026/apr/15/eternal-november-generative-ai-llm/ …my colleague Denver Gingerich writes: newcomers' extensive reliance on LLM-backed generative AI is comparable to the Eternal September onslaught to USENET in 1993.

mu@mastodon.nz

@wwahammy @bkuhn @kees @glitzersachen @josh @silverwizard @ossguy @xgranade the people wanting to kill Sam Altman are doing so because they are afraid of the AI Doomer stories, this discussion about including slop in software is very different.

mu@mastodon.nz

@bkuhn @firefly_lightning @josh @kees @ossguy ok Neville Chamberlain

jedbrown@hachyderm.io

@cwebber
I agree about the hazard. LLM outputs should be considered derivative of all their inputs unless established otherwise. LLMs manipulate expression, not ideas, and the propensity of verbatim reproduction (up to and including entire books) is evidence of that process. Note that the purpose of the "substantial similarity" test is as circumstantial evidence of process.

I think the counterpoints are "mutually-assured destruction" and/or "yolo denial-of-service attack on copyright will win because power likes it". "AI" companies are still delaying cases from 2022 (like Doe v GitHub) because they want a jury who believes it is inevitable. Plaintiffs seek to win their cases, not to establish broad precedent. OpenAI has already lost (in German court) on copyright infringement of their outputs, arguing unsuccessfully that the infringement is the sole responsibility of their customers for prompting. The political reality of public sentiment is changing and collapse of the financial bubble will greatly alter the power held by "AI" companies.

Meanwhile, I think the words of the DCO ought to mean something, even for those who are certain they are a smol bean.

https://hachyderm.io/@jedbrown/114931171543347621

@bkuhn @ossguy @richardfontana

gulfie@mastodonapp.uk

@ossguy @davidgerard @wwahammy @silverwizard @firefly_lightning @cwebber big nope.

bkuhn@fedi.copyleft.org

@jedbrown

A case from 2022 still not a trial in 2026 doesn't indicate unreasonable or manipulative delay by Defendants. Such cases really do take that long.

Also, Doe vs. Microsoft's Github is a terribly constructed case and actually pushes us toward compulsory licensing of #FOSS works for #LLM-backed gen-#AI training— since the Plaintiff's lawyers in that case are clearly chasing their own avarice, not software freedom.

Background:
https://sfconservancy.org/news/2022/nov/04/class-action-lawsuit-filing-copilot/

@cwebber @ossguy @richardfontana

bkuhn@fedi.copyleft.org

@cwebber

Re: “polluting”, my reply is: https://fedi.copyleft.org/@bkuhn/116426437134023846 (elsewhere in thread).

Re: “copyleft-only #LLM”: I didn't propose that. I proposed copylefting the human-modified output of LLMs.

Re: “two scenarios”: IMO you propose a false dichotomy.

I hope you come to one of #SFC's public sessions on this, as I'd be glad to talk more about it, & this discussion doesn't lend itself to online debate because it's so complex.

cc: @ossguy @richardfontana
@jedbrown

#AI #OpenSource #FOSS

bkuhn@fedi.copyleft.org

@mu

A WWII reference is never helpful in a discussion unless the topic *is actually* WWII.

I'd be glad to have a serious discussion with you, but if you follow Godwin's law again, I probably will block you.

I know emotions are frayed and the FOSS community is frightened and worried, so I forgive you. But there is no reason to claim the situation with LLM-backed AI is tantamount to Hitler's violent invasion of Europe.

Cc: @firefly_lightning @josh @kees @ossguy @cwebber

bkuhn@fedi.copyleft.org

@evan … but I know you're only half joking.

Frankly part of the problem here is that people are either taking this situation *too* seriously or not serious enough. I'm guessing you're right in the happy medium, but your comment made me think of that point.

Cc: @richardfontana @cwebber @ossguy

bkuhn@fedi.copyleft.org

@evan
I have a speculative suspicion that the “leak” of Claude's front-end code was a false flag operation *hoping* someone would so-called “clean-room-with-Claude” their own UI.
I have this theory b/c the UI code is not what Claude needs to IPO (it's all the server side stuff that matters), and it behooves them & their investors if they themselves take a “fair's fair” position on the leak of their own code.
I'm meanwhile working on the chardet situation.

Cc: @richardfontana @cwebber @ossguy

bkuhn@fedi.copyleft.org

@evan

I don't mind that you tried (and I even clicked on the link so I guess I burnt down a rainforest?), but this reads like LLM-backed gen-AI slop to me. Full of truthiness but seems to lack depth of understanding of the AFC test.

I hope you can make it to one of SFC's chats on this topic.

Cc: @richardfontana @cwebber @ossguy

bkuhn@fedi.copyleft.org

@mu

I saw this comment after I saw you elsewhere in the thread comparing the LLM-backed genAI situation to WWII, so I am have a lot of trouble taking this seriously.

Plus your comment is snarky, sarcastic, mean, and slightly ad hominem. There is no reason for all that in civil debate.

Cc: @wwahammy @silverwizard @cwebber @richardfontana

bkuhn@fedi.copyleft.org

@RichardJActon
The copyleft-ish hack I propose is *we* (FOSS community) assume that any output of an LLM-backed genAI system *is* copylefted (since we are pretty sure all such systems — at least those designed for software development assist — have been trained on copylefted codebases).
Then, we copyleft any work that comes out of the system.
The only threat is proprietary software in the training set, & the industry can't abide enforcing *that*!
@cwebber @ossguy @richardfontana
@evan
@kees

bkuhn@fedi.copyleft.org

@cwebber

We already know the situation isn't equitable & probably won't become such in our lifetimes. Microsoft already all-but-admitted they will never train Copilot on their code. No proprietary software company is going to offer training data back to other vendors.

The goal here obviously was to LLM-wash away copyleft. *That* we must resist, and use their own tools against them: which is the very spirit that made copyleft in the first place!

Cc: @evan @richardfontana @ossguy @kees
@karen

bkuhn@fedi.copyleft.org

@evan wrote:

> “I consider myself an expert on this process since I learned about it 45 minutes ago ”

This is the second time you've made me in this thread. Thanks for being comic relief (and I know that's not *all* you're doing, but that part is particularly helpful). Thank you!

Cc:
@richardfontana @cwebber @ossguy
@karen

bkuhn@fedi.copyleft.org

@sfoskett

*Thaler is limited to DC Circuit & very narrow. It's a registration question, & even *its* dicta hints there is no way we can know the answer on (1).

I think (2) is a strong argument.

As for (3), there is huge value to be extracted by applying copyleft-ish principles (and copyleft licenses themselves) to LLM-backed genAI output.

In worse case: a big complex mix of public domain + copylefted-human-authored stuff can't easily be separated.

@richardfontana @evan @cwebber @ossguy

bkuhn@fedi.copyleft.org

@wwahammy

Indeed, SFC's position is #GiveUpGithub, but N.B. the https://giveupgithub.com/ site itself admits most people will uses it & suggests a “using Github under protest” README.md.

I use proprietary software every day. I've been convinced for ≥ 10yrs: one can't succeed in an industrialized nation at *anything* w/out sometimes doing so.

The difficulty is figuring out when to compromise. I remain open-minded.
Few of us will be FOSS monks.

@ossguy @cwebber @LordCaramac @richardfontana

bkuhn@fedi.copyleft.org

@richardfontana wrote:
> “oh I mean of course you could use LLMs to help with the analysis ”

I'm catching up backwards on this thread, but do you see now the monster you created by telling @evan that?

cc: @cwebber @ossguy @karen

bkuhn@fedi.copyleft.org

LLM-backed genAI never makes as good jokes as you do, @evan

But are you finally coming clean with us here today that, in fact, #EvanPoll's are all created by a genAI system?

Cc: @richardfontana @cwebber @ossguy @karen

bkuhn@fedi.copyleft.org

@sfoskett
I responded in detail in another post to your conclusions later, but the assumption is wrong too. It's just pure FUD to say: “works generated by AI are not copyrightable per the US Supreme Court”.
https://sfconservancy.org/blog/2026/mar/04/scotus-deny-cert-dc-circuit-thaler-appeal-llm-ai/
TL;DR: *DC Circuit* held that a specific copyright registration *for a digital painting* that lists a computer program as the sole author is not eligible *at this time* for copyright *registration*. SCOTUS decided to not hear the case.

@evan @cwebber @richardfontana

bkuhn@fedi.copyleft.org

@richardfontana

I'm with @cwebber, there is no way to automate compliance. But, again, we should use that to our advantage in a copyleft-ish way.

Cc: @ossguy