Make illegally trained LLMs public domain as punishment : technology

[–] [email protected] 0 points 17 hours ago

Nice one

[–] [email protected] 3 points 1 day ago (1 children)

Are you threatening me with a good time?

First of all, whether these LLMs are "illegally trained" is still a matter before the courts. When an LLM is trained it doesn't literally copy the training data, so it's unclear whether copyright is even relevant.

Secondly, I don't think that making these models "public domain" would have the negative effects that people angry about AI think it would. When a company is running a closed model internally, like ChatGPT for example, the model is never available for download in the first place. It doesn't matter if it's public domain or not because you can't get a copy of it. When a company releases an open-weight model for public use, on the other hand, they usually encumber them with some sort of license that makes them harder for competitors to monetize or build on. Making those public-domain would greatly increase their utility. It might make future releases less likely, but in the meantime it'll greatly enhance AI development.

[–] [email protected] 2 points 1 day ago (4 children)

The LLM does reproduce copyrighted data though.

[–] [email protected] 3 points 1 day ago

How?

[–] [email protected] 2 points 1 day ago

*it can produce data identical to data that has been copyrighted before

load more comments (2 replies)

[–] [email protected] 2 points 1 day ago* (last edited 1 day ago)

Only if they were trained on public material.

[–] [email protected] 1 points 1 day ago

Doesn't seem like this helps out all the writers / artists that the LLM stole from.

[–] [email protected] 0 points 1 day ago

Yes!

[+] [email protected] -15 points 1 day ago (3 children)

Your data is worthless. Only Linux type zealots (conspiracy theorists) harp on that. Ever copied a meme and shared it elsewhere?

[–] [email protected] 4 points 1 day ago

Negative reputation troll.

[–] [email protected] 0 points 22 hours ago

Stay in your hugbox bro.

[–] [email protected] -1 points 1 day ago

Not only that, but copyright applies to copying, not reading, which is what it’s doing.

[+] [email protected] -6 points 1 day ago* (last edited 1 day ago) (8 children)

I mean, if we really are following the spirit of copyright, since no-one at open AI or other companies developed matrix and vector multiplication (operations existing in the public domain because Platonism is a thing).

Edit: oh my, I guess the consensus is that stealing the work of mathematicians is ok (or more, classifying our constructions as discoveries).

[–] [email protected] 0 points 1 day ago (4 children)

What is this perspective?

load more comments (4 replies)

load more comments (7 replies)

Technology

Our Rules

Approved Bots