👀 … https://sfconservancy.org/blog/2026/apr/15/eternal-november-generative-ai-llm/ …my colleague Denver Gingerich writes: newcomers' extensive reliance on LLM-backed generative AI is comparable to the Eternal September onslaught to USENET in 1993.

cwebber@social.coop

@bkuhn @ossguy @LordCaramac @richardfontana

- There are plenty of FOSS projects we care about which are not under copyleft. What terms should they consider received code under? Should SDL now consider all LLM based output under the GPL? The AGPL? Which? Do you expect such a project to switch its license to copyleft now?
- Microsoft's proprietary code may not be, but plenty of proprietary code is available under extremely non-FOSS and restrictive licenses which are within datasets we are getting contributions from *today*
- The mutually assured destruction "safe option" isn't that things are under copyleft for proprietary companies though, that's still a losing scenario for them. So that doesn't help the case for copyleft, only accepting that LLM output under the public domain is (which we don't know)

cwebber@social.coop

@bkuhn @ossguy @LordCaramac @richardfontana It's somewhat of an aside, but my point regarding regarding Microsoft's codebase is not that Windows' code is in the inputs (this is true), my point was about a more interesting test for licence laundering is to launder a *leaked* proprietary codebase. If it's possible to copyright launder GPL'ed code, the equitable thing is that we should be able to copyright launder proprietary code. But again, that's somewhat of a tangent from the main points.

evan@cosocial.ca

@bkuhn @cwebber @ai_cases both great resources, tysm!

cwebber@social.coop

@trwnh @bkuhn @ossguy @richardfontana Plenty of Microsoft code has been released under "shared source" licenses and also leaks

jens@social.finkhaeuser.de

@cwebber @bkuhn @ossguy @richardfontana Worse IMHO is that we're putting FOSS as a movement at risk if we deskill everyone to the point where you either pay money to have code generated for you, or there is no code.

cwebber@social.coop

@evan @bkuhn @ai_cases I will admit that getting into a big ol licensing debate does feel very original-fediverse

cwebber@social.coop

@jens @bkuhn @ossguy @richardfontana This is indeed a serious risk, though tangential to this subthread. But it's a concern I also have.

trwnh@mastodon.social

@cwebber @bkuhn @ossguy @richardfontana sure, but my point is this would happen less often

jens@social.finkhaeuser.de

@cwebber @bkuhn @ossguy @richardfontana Fully tangential, agreed.

richardfontana@mastodon.social

@cwebber copyleft-only LLM is nonsensical , agreed @bkuhn @ossguy

cwebber@social.coop

@richardfontana @bkuhn @ossguy Glad to hear we agree there!

fuzzychef@m6n.io

@cwebber @bkuhn @ossguy @richardfontana

Based on my following of current legal cases, I think it's entirely possible that in a year or two we'll suddenly be rolling large OSS codebases back to 2023. And won't that be fun!

evan@cosocial.ca

@cwebber

Are you concerned that the LLMs generate nontrivial verbatim excerpts of copyrighted works?

Or that there is a hidden "intellectual property" in the deep patterns that they use?

Say, when an LLM was trained on a file I made with an interesting loop structure, and it emits code with a similar loop structure, even if the variable names, problem domain, details, or programming language differ.

What if a court says I can demand royalties for my "IP"?

@bkuhn @ossguy @richardfontana

richardfontana@mastodon.social

@cwebber I mean, as a practical idea worth contemplating. Could imagine it as an experiment by someone with sufficient resources. There were some highly ill-conceived efforts to create anti-copyleft models a few years ago @bkuhn @ossguy

evan@cosocial.ca

@cwebber @bkuhn @ossguy @richardfontana

Like, not copyrightable, not patents, but some secret third thing, kind of what people mean when we say that someone "copied our idea".

cwebber@social.coop

@evan @richardfontana I am saying we don't know the answer to that question, and it seems that @bkuhn and @ossguy agree that we don't know the answer to it, based on previous posts, and the lack of knowledge about what the copyright implications of LLM based contributions means that we are creating a schrodingers-licensing-timebomb for our FOSS codebases

cwebber@social.coop

@evan @bkuhn @ossguy @richardfontana I am talking about copyright

evan@cosocial.ca

@cwebber excellent, thanks!

@bkuhn @ossguy @richardfontana

cwebber@social.coop

@evan @bkuhn @ossguy @richardfontana Say for a moment that we *did* make a model which intentionally pulled in leaked source code from various proprietary codebases.

What would your opinion be on the legal-hazard state of accepting that code output? Would you consider it relatively safe from a copyright perspective?

richardfontana@mastodon.social

@cwebber I think adequate compliance might be possible with good enough detection/matching tools but I don't necessarily expect such tools to be developed (let alone available to foss projects) (my assumption is that the few such tools in use today are pretty bad) @bkuhn @ossguy