Open source license that doesn’t allow your code to be used for AI data training? (feddit.online)

submitted 4 months ago by cat_fishing@feddit.online to c/opensource@programming.dev

41 comments fedilink hide all child comments

Does a license like this exist?

you are viewing a single comment's thread
view the rest of the comments

[-] asudox@lemmy.asudox.dev 22 points 4 months ago

I found this, which adds additional text to the existing licenses to prohibit training an AI on the licensed code: https://github.com/non-ai-licenses/non-ai-licenses

Though, per OSI's definition, your code probably would no longer be open source, since training an LLM is technically considered a field of endeavour:

OSD number 6:

The license must not restrict anyone from making use of the program in a specific field of endeavor. For example, it may not restrict the program from being used in a business, or from being used for genetic research.

[-] cat_fishing@feddit.online 9 points 4 months ago

This is exactly what I was looking for. Thanks!

[-] veniasilente@lemmy.dbzer0.com 4 points 4 months ago

We need a new field of licensing, something like Ethical Source License. With AI being a thing on-the-field, and even before tbh, Open Source has alas become a paradigm of the past.

[-] hperrin@lemmy.ca 4 points 4 months ago

Maybe you could say that AI training is not a use of the program, it is a use of the source code.

[-] ulterno@programming.dev 4 points 4 months ago

OSI is not US court (or at least I hope not).
Playing on words isn't going to get the license accepted.

On the other hand, why does it have to be accepted?
You are doing something different. Just do the different thing.

[-] Orygin@sh.itjust.works 2 points 4 months ago

If we can play semantics, the program (the compiled binary) can be used for anything with no field restrictions.
But the code is not the program itself, it's the recipe, and usage could be restricted in some specific ways.
In my opinion, since free licenses already have restrictions regarding distribution, saying AI models trained on this data are derivative works and must be licensed compatible (ie training data set, methods and models themselves being free).
I feel it's a better middle ground where the freedom of users are not violated nor restricted, and the code/knowledge stays free

this post was submitted on 24 Feb 2026

61 points (93.0% liked)

Opensource

6346 readers

77 users here now

A community for discussion about open source software! Ask questions, share knowledge, share news, or post interesting stuff related to it!

Credits

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

⠀

founded 2 years ago

MODERATORS

pylapp@programming.dev