this post was submitted on 27 Nov 2024
26 points (88.2% liked)

Free Software

1071 readers
4 users here now

What is free software?

Free software is software that respects the 4 software freedoms. The 4 freedoms are

Please note: Free software does not relate to monetary price. Free software can be sold or gratis (no cost)

Rules:

  1. Please keep on topic
  2. Follow the Lemmy.zip rules
  3. No memes
  4. No "circle jerking" or inflammatory posts
  5. No discussion of illegal content

Please report anything you believe to violate the rules and be sure to include rhetoric on why you think it should be removed.

If you would like to contest mod actions please DM me with your rational as to why you feel that the relivant mod action should be reversed. Remember to use rhetoric and to site any relevant sources. You will only get one chance to argue your point and continued harassment will result in a ban.

Overall this community is pretty laid back and none if the things list above normally are an issue.

founded 1 year ago
MODERATORS
 

How can I add a simple requirement "do not train Al on the source code of the program" to AGPLv3 or GPLv3 and thereby create a new license?

Don't know is it a good place for such a question but I try :).

Why did I come up with such an stupid idea? There have been reported cases where artificial intelligence such as Github Copilot has been trained on many open source and free software projects, and in some cases it can output code snippets from GPL-licensed projects without specifying it. https://www.pixelstech.net/article/1682104779-GitHub-Copilot-may-generate-code-containing-GPL-code

I am not a lawyer, and I do not know where it is better to insert such a requirement. And how to formulate it in the best and correct form.

I understand it maybe complicated to check, to comply with this requirement and it may cause other difficulties, but I still think it can be a useful addition.

How to fit it with the fundamental freedoms of the GPL or it is unfitable?

I understand that this would make the license non-free, since it puts constraints on what the code can be used for. It's sad that it doesn't combine in some way. Maybe change requirements to do not train "closed source AI"(without code and training data of AI model publicly available).

And how can I name it? Is it better to name it without "GPL" If this new license cannot be considered free? NoAIFL or your variants :)?

Is it good to just add a new item?

For example like this:

Additional Clause:
You may not use the source code of this program, or any part thereof, to train any artificial intelligence model, machine learning model, or similar system without explicit written permission from the copyright holder.

or

Section [X]:
Restrictions on AI Training You may not use the source code of this program, or any part thereof, to train any artificial intelligence model, machine learning model, or similar system without explicit written permission from the copyright holder.

What you think about it? Maybe you already know licenses like this?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 2 points 3 weeks ago* (last edited 3 weeks ago) (1 children)

Thank you very much for your reply. I support your opinion in a way that I am already inclined that a complete prohibition on the training of "AI" models on the source code of software is not a very good solution and is difficult to limit according to current laws. I hope somtimes someone smart will come up with some approaches to such problems.

[โ€“] [email protected] 2 points 3 weeks ago* (last edited 3 weeks ago)

Indeed. I hope so. And we desperately need some clear regulations. Even the big AI companies struggle with the lack of clear rules. I can see how we need to go through quite some legal battles to settle some questions arising with the new technology. And that's currently taking place. But it extends past that. Currently, companies are retreating from the European market. Due to a completely unmanageable situation. I've seen local language models (starting with Llama 3.2) being banned / not licensed within the EU. And that's going to lead to all kinds of complications. Just because the EU can't get some proper regulations out, and do it in time. That'll leave technological progress behind in the EU, mess with companies. In effect also take away my freedom to run language model on my own hardware...

I hope they get that straight. And there is some demand... So maybe it's happening sooner than later. But these are very difficult questions to answer. About AI safety, copyright, effect and impact on society and freedom... And I think a lot of these questions are difficult to tackle with licensing anyways. We definitely need laws governing if AI training is fair use. Or if generating a voice that sounds 70% like David Attenborough is alright to do.