👀 … https://sfconservancy.org/blog/2026/apr/15/eternal-november-generative-ai-llm/ …my colleague Denver Gingerich writes: newcomers' extensive reliance on LLM-backed generative AI is comparable to the Eternal September onslaught to USENET in 1993.
-
@cwebber @bkuhn @ossguy @richardfontana
Based on my following of current legal cases, I think it's entirely possible that in a year or two we'll suddenly be rolling large OSS codebases back to 2023. And won't that be fun!
Can you please cite the actual precedent?
If it's ongoing, yet-undecided cases you mean, which of the 100s of cases do you mean, what rulings have occurred that lead you to this speculation, and why?
I know you didn't mean to, but your post just feeds the FUD monsters.
-
@cwebber @bkuhn @ossguy @richardfontana Worse IMHO is that we're putting FOSS as a movement at risk if we deskill everyone to the point where you either pay money to have code generated for you, or there is no code.
@jens wrote:
> “we're putting FOSS as a movement at risk if we deskill everyone to the point where you either pay money to have code generated for you, or there is no code.”I agree completely. We *need* to encourage extreme discipline if LLM-backed genAI systems are used for software to ensure: (a) experienced developers' skills don't atrophy, and (b) ensure that new developers understand these tools aren't for neophytes b/c they newbies far astray.
-
I don't see a plausible path where the timebomb exists: (a) likely none of these proprietary LLM-backed genAI systems are *trained* on proprietary software, & (b) even if they *are*, the proprietary industry as a whole seems very much to *want* to maintain this absolute fiction that these systems are magically always public domain, & if not fair use defense always works.
We meanwhile use copyleft-ish strategies to beat them at their own game.
@bkuhn @cwebber @evan @richardfontana @zacchiro Even if attribution issues disappear. Surely it's a time bomb in terms of projects who are intentionally not using copyleft licenses. Or incompatible licenses?
-
@bkuhn @richardfontana @evan @cwebber @ossguy Wow I really appreciate you weighing in here! I was thinking Naruto v. Slater for point one not just Thaler but I certainly defer to your expertise especially on point 3.
@bkuhn @richardfontana @evan @cwebber @ossguy Interesting! “Copyright law — and the legal precedents around it — differ widely for different types of creative works. Analysis of the copyrightability of works of software varies in notable ways. Therefore, do not to assume that analysis for images apply broadly to software.”
-
A WWII reference is never helpful in a discussion unless the topic *is actually* WWII.
I'd be glad to have a serious discussion with you, but if you follow Godwin's law again, I probably will block you.
I know emotions are frayed and the FOSS community is frightened and worried, so I forgive you. But there is no reason to claim the situation with LLM-backed AI is tantamount to Hitler's violent invasion of Europe.
-
@trwnh @bkuhn @ossguy @richardfontana Plenty of Microsoft code has been released under "shared source" licenses and also leaks
@cwebber wrote:
> “ Plenty of Microsoft code has been released [publicly] under "shared source" licenses [and may well be in training sets]”
An interesting point! We should call on Microsoft to agree: if you end up with copyright violations of their source-available FOSS licenses, that they will offer a unilateral covenant-not-to-sue. They already indemnify Copilot users, but it's good advocacy tactics to point out they didn't indemnify Claude users for same.
Cc: @trwnh @richardfontana @ossguy -
@bkuhn @cwebber @evan @richardfontana @zacchiro Even if attribution issues disappear. Surely it's a time bomb in terms of projects who are intentionally not using copyleft licenses. Or incompatible licenses?
@bkuhn @cwebber @evan @richardfontana @zacchiro OpenZFS is accepting LLM assisted patches today. If a CDDL project like it received non trivial near verbatim GPL code from another project via a LLM I would think that would be an issue. Or vice versa, CDDL code in a GPL project.
-
I saw this comment after I saw you elsewhere in the thread comparing the LLM-backed genAI situation to WWII, so I am have a lot of trouble taking this seriously.
Plus your comment is snarky, sarcastic, mean, and slightly ad hominem. There is no reason for all that in civil debate.
@bkuhn @wwahammy @silverwizard @cwebber @richardfontana you seem to be caught on prissy rules of politeness rather than being able to see the meat of my argument.
If you want to give LLM advocates every accommodation, but are unwilling to do that for other people, then you are making clear your preferences on the subject in a way that shows you were never trying to understand both sides, you were never willing to move your position, and you are just pushing an agenda. All your talk of debates and understanding both sides rings hollow if you can only accept discussion in one particular format.
Hypocrite.
If I'm wrong, prove me wrong.
You're not going to.
-
@bkuhn @ossguy @LordCaramac @richardfontana
- There are plenty of FOSS projects we care about which are not under copyleft. What terms should they consider received code under? Should SDL now consider all LLM based output under the GPL? The AGPL? Which? Do you expect such a project to switch its license to copyleft now?
- Microsoft's proprietary code may not be, but plenty of proprietary code is available under extremely non-FOSS and restrictive licenses which are within datasets we are getting contributions from *today*
- The mutually assured destruction "safe option" isn't that things are under copyleft for proprietary companies though, that's still a losing scenario for them. So that doesn't help the case for copyleft, only accepting that LLM output under the public domain is (which we don't know)If there was ever a time in 40+ years of #FOSS history to tell our #copyleft-hating FOSS friends that they erred in their license choices, now is the time.
If they don't switch, they're giving hand-outs to the proprietary software companies. Now, in an entirely new & #disturbing way.
I really think these cases where proprietary software ends up in #LLM training sets & actually creates risk are exceedingly rare, if not entirely hypothetical.
-
@cwebber @LordCaramac @bkuhn @richardfontana Sadly it will be years before we have an answer re copyright and we can't wait for that. Outlining usage in the meantime is the best we can do, in case we need to do something with that later.
We know proprietary software companies are using these tools extensively, so this is in effect a mutually assured destruction situation. While we wait, we should make sure that we are pushing freedom on all other axes, since they won't do that part.
@ossguy @cwebber @LordCaramac @bkuhn @richardfontana This sounds to me like the proprietary software companies are using code LLMs to mass copyright launder copyleft, GPL etc. code so as to basically incorporate it all without honouring its code (re)distribution terms. This is a grimmer situation than I had anticipated, and I'm not widely known as an optimist.
-
@cwebber @bkuhn @ossguy @richardfontana
Indeed, big tech know full well the FLOSS / indie creators don't have the legal funds to defend. Their IP either.
I've spent my whole career building organizations that could, in fact, defend against Big Tech when they trample on the FOSS community's rights.
While it's difficult work to fund, SFC has done it on a shoestring budget for two decades now, and we've yielded results that include both copyright and contract forms of litigation.
Big Tech *is* afraid of us. The mouse can roar.
See also: https://sfc.ngo/vizio/
-
If there was ever a time in 40+ years of #FOSS history to tell our #copyleft-hating FOSS friends that they erred in their license choices, now is the time.
If they don't switch, they're giving hand-outs to the proprietary software companies. Now, in an entirely new & #disturbing way.
I really think these cases where proprietary software ends up in #LLM training sets & actually creates risk are exceedingly rare, if not entirely hypothetical.
@bkuhn @cwebber @bwana @zacchiro @richardfontana Permissive licenses except extreme ones like maybe 0BSD don't allow LLM training or regurgitation either. They almost universally require copyright notice & attribution which slop can't do.
-
@bkuhn @cwebber @zacchiro @richardfontana We don't see any successful litigation yet stopping abusive "AI" companies from scooping up any free software, regardless of the strength of the license. It's a matter of power imbalance and who controls the legal system, not who's right or wrong. I don't thing stronger license terms will save us. Only organizing to bring them down can do that.
-
@bkuhn @cwebber @bwana @zacchiro @richardfontana Permissive licenses except extreme ones like maybe 0BSD don't allow LLM training or regurgitation either. They almost universally require copyright notice & attribution which slop can't do.
Have you seen attribution lawsuits anywhere in the 50 years of the BSD license?
Of course, you have a point. But we're facing possible compulsory licensing if we don't improve our strategy. We may do the wrong thing, & if a BSD-licensed project wants to burn it down & sue a copylefted project for non-attribution infringement, let's deal with that when it comes.
& I'm speaking as one of the few who has actually been threatened with such a lawsuit.
-
@bkuhn @cwebber @zacchiro @richardfontana We don't see any successful litigation yet stopping abusive "AI" companies from scooping up any free software, regardless of the strength of the license. It's a matter of power imbalance and who controls the legal system, not who's right or wrong. I don't thing stronger license terms will save us. Only organizing to bring them down can do that.
@dalias
Any complex public policy litigation takes ≥ 10 years to make its way through the Courts. *Thaler* was an anomaly precisely because Thaler narrowed the issue on purpose to a pointless degree (just to make a point, apparently).
The reason copyleft exists is because rather than wait for the Courts & legislature to make good law, we used the existing system against itself.I propose we design a series of moves that can do the same for LLM-backed genAI.
-
@dalias
Any complex public policy litigation takes ≥ 10 years to make its way through the Courts. *Thaler* was an anomaly precisely because Thaler narrowed the issue on purpose to a pointless degree (just to make a point, apparently).
The reason copyleft exists is because rather than wait for the Courts & legislature to make good law, we used the existing system against itself.I propose we design a series of moves that can do the same for LLM-backed genAI.
@bkuhn @cwebber @zacchiro @richardfontana At the time we used the existing system against itself, it was largely a working system. Corporations actually feared "IP" law. They put
and
and
all over the place meticulously. Companies had lost theirs by not doing everything by the letter of the law in the past.Nowadays, we hardly have a rule of law to begin with, and corporations just get to settle any infringement they're caught doing for 0.01% of the profit they made off the infringement. I don't see a way to use this system against itself. It's going to take more drastic things to tear it down.
-
@bkuhn @cwebber @zacchiro @richardfontana At the time we used the existing system against itself, it was largely a working system. Corporations actually feared "IP" law. They put
and
and
all over the place meticulously. Companies had lost theirs by not doing everything by the letter of the law in the past.Nowadays, we hardly have a rule of law to begin with, and corporations just get to settle any infringement they're caught doing for 0.01% of the profit they made off the infringement. I don't see a way to use this system against itself. It's going to take more drastic things to tear it down.
I disagree: the system never “worked”. It was always stacked against users' rights & freedom to share.
I have said elsewhere in this mega-thread: Big Tech vs. Big Content battle in the Courts has so many moving parts that we'll never predict the outcome quite right. FOSS is adrift on that sea, & we have to do the best we can now with the information we have.
As I keep saying, the boycotts have not solved the problem.
What's next?
-
@RichardJActon
The copyleft-ish hack I propose is *we* (FOSS community) assume that any output of an LLM-backed genAI system *is* copylefted (since we are pretty sure all such systems — at least those designed for software development assist — have been trained on copylefted codebases).
Then, we copyleft any work that comes out of the system.
The only threat is proprietary software in the training set, & the industry can't abide enforcing *that*!
@cwebber @ossguy @richardfontana
@evan
@kees -
@novalis
I agree with your supporting arguments but not the conclusion.It goes back to the mutually assured destruction idea: no one in the for profit proprietary software industry is going to bring a lawsuit because they are so invested in LLM-backed AI succeeding.
That's where our commons differs widely from other creative works of expression.
I am worried about compulsory licensing for *training* —could be a disaster — but unrelated to output.
@bkuhn @zacchiro @cwebber @ossguy @richardfontana Actually, I just thought of one proprietary software company that would be much happier not to have LLMs around: Salesforce. Nobody's going to buy their overpriced shit when the alternative is to vibecode something that works exactly with your business process and that you can change, yourself, any time you want at the cost of a couple hundred bucks of Claude and a few hours of work.
-
If there was ever a time in 40+ years of #FOSS history to tell our #copyleft-hating FOSS friends that they erred in their license choices, now is the time.
If they don't switch, they're giving hand-outs to the proprietary software companies. Now, in an entirely new & #disturbing way.
I really think these cases where proprietary software ends up in #LLM training sets & actually creates risk are exceedingly rare, if not entirely hypothetical.
@bkuhn This might be of interest, https://en.wikipedia.org/wiki/Open_source_license_litigation.