👀 … https://sfconservancy.org/blog/2026/apr/15/eternal-november-generative-ai-llm/ …my colleague Denver Gingerich writes: newcomers' extensive reliance on LLM-backed generative AI is comparable to the Eternal September onslaught to USENET in 1993.

cwebber@social.coop

@evan @bkuhn @ai_cases I will admit that getting into a big ol licensing debate does feel very original-fediverse

cwebber@social.coop

@jens @bkuhn @ossguy @richardfontana This is indeed a serious risk, though tangential to this subthread. But it's a concern I also have.

trwnh@mastodon.social

@cwebber @bkuhn @ossguy @richardfontana sure, but my point is this would happen less often

jens@social.finkhaeuser.de

@cwebber @bkuhn @ossguy @richardfontana Fully tangential, agreed.

richardfontana@mastodon.social

@cwebber copyleft-only LLM is nonsensical , agreed @bkuhn @ossguy

cwebber@social.coop

@richardfontana @bkuhn @ossguy Glad to hear we agree there!

fuzzychef@m6n.io

@cwebber @bkuhn @ossguy @richardfontana

Based on my following of current legal cases, I think it's entirely possible that in a year or two we'll suddenly be rolling large OSS codebases back to 2023. And won't that be fun!

evan@cosocial.ca

@cwebber

Are you concerned that the LLMs generate nontrivial verbatim excerpts of copyrighted works?

Or that there is a hidden "intellectual property" in the deep patterns that they use?

Say, when an LLM was trained on a file I made with an interesting loop structure, and it emits code with a similar loop structure, even if the variable names, problem domain, details, or programming language differ.

What if a court says I can demand royalties for my "IP"?

@bkuhn @ossguy @richardfontana

richardfontana@mastodon.social

@cwebber I mean, as a practical idea worth contemplating. Could imagine it as an experiment by someone with sufficient resources. There were some highly ill-conceived efforts to create anti-copyleft models a few years ago @bkuhn @ossguy

evan@cosocial.ca

@cwebber @bkuhn @ossguy @richardfontana

Like, not copyrightable, not patents, but some secret third thing, kind of what people mean when we say that someone "copied our idea".

cwebber@social.coop

@evan @richardfontana I am saying we don't know the answer to that question, and it seems that @bkuhn and @ossguy agree that we don't know the answer to it, based on previous posts, and the lack of knowledge about what the copyright implications of LLM based contributions means that we are creating a schrodingers-licensing-timebomb for our FOSS codebases

cwebber@social.coop

@evan @bkuhn @ossguy @richardfontana I am talking about copyright

evan@cosocial.ca

@cwebber excellent, thanks!

@bkuhn @ossguy @richardfontana

cwebber@social.coop

@evan @bkuhn @ossguy @richardfontana Say for a moment that we *did* make a model which intentionally pulled in leaked source code from various proprietary codebases.

What would your opinion be on the legal-hazard state of accepting that code output? Would you consider it relatively safe from a copyright perspective?

richardfontana@mastodon.social

@cwebber I think adequate compliance might be possible with good enough detection/matching tools but I don't necessarily expect such tools to be developed (let alone available to foss projects) (my assumption is that the few such tools in use today are pretty bad) @bkuhn @ossguy

cwebber@social.coop

@richardfontana @bkuhn @ossguy That's a problem so hard it throws the "NP complete" debate out the window in favor of something brand new. Given that these codebases have no trouble "translating" from one language's source code into another, how on *earth* could you possibly hope to build a compliance tool around that?

Laughable, to anyone who tries.

richardfontana@mastodon.social

@cwebber to be clear compliance cannot somehow be built in to the LLM for reasons you stated, but ancillary tools for LLM users to reconstruct provenance exist and conceivably could be made more useful @bkuhn @ossguy

zacchiro@mastodon.xyz

@cwebber @bkuhn @ossguy @richardfontana

My current answer to your "is it safe" question is to answer a slightly different question. Namely: "is it any less safe than accepting code from a random employee that claims to be submitting under a inbound=outbound regime, whereas in fact they cannot?". The latter we have been doing for decades, with limited damages to the commons.

(I *also* think the legal odds are more in our favor with AI-assisted contributions than in the previous case.)

cwebber@social.coop

@zacchiro @bkuhn @ossguy @richardfontana While true, there is a big difference in that the previous scenario was someone out of compliance with what the community actually accepted as hygienic and acceptable contributions, and those contributions were relatively rare.

Saying that we don't need to worry about the risks from these tools right now from a licensing situation is different: it's advising on a path being acceptable where we *don't know* whether or not it's generally safe practice to recommend! And which most in this thread seem to agree we don't know. Even your post seems to say "it seems like it'll probably be okay and end up in our favor".

I guess I feel increasingly like I am maybe the only "oldschool FOSS licensing wonk" who cares about this, and maybe that means I should just give up.

But *damn* I can't believe it feels like when people are both saying "we don't know what the implications will be" we're also saying "so go ahead and say those patches are a-ok!"

cwebber@social.coop

@richardfontana As said here, given the "translation between languages" aspect, I can't really see that as likely to be true https://social.coop/@cwebber/116426770262334234

Which maybe that means that all this stuff really is public domain, a position I am *fully willing to accept*! But I don't think it's known (especially internationally), and I don't think @bkuhn or @ossguy are eager to adopt that perspective