this post was submitted on 26 Jul 2023
863 points (96.4% liked)

Technology

59581 readers
4728 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Thousands of authors demand payment from AI companies for use of copyrighted works::Thousands of published authors are requesting payment from tech companies for the use of their copyrighted works in training artificial intelligence tools, marking the latest intellectual property critique to target AI development.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 4 points 1 year ago (1 children)

Go ask a human for the lyrics of a song and then tell me that's transformative work.

Oh wait, no one would say that. This is why the discussion with non-technical people goes into the weeds.

[–] [email protected] 2 points 1 year ago (1 children)

Because it would be totally clear to anyone that reciting the lyrics of a song is not a transformative work, but instead covered by copyright.

The only reason why you can legally do it, is because you are not big enough to be worth suing.

Try singing a copyrighted song in TV.

For example, until it became clear that Warner/Chappell didn't actually own the rights to "Happy Birthday To You", they'd sue anyone who sung that song in any kind of broadcast or other big public thing.

Quote from Wikipedia:

The company continued to insist that one cannot sing the "Happy Birthday to You" lyrics for profit without paying royalties; in 2008, Warner collected about US$5,000 per day (US$2 million per year) in royalties for the song. Warner/Chappell claimed copyright for every use in film, television, radio, and anywhere open to the public, and for any group where a substantial number of those in attendance were not family or friends of the performer.

So if a human isn't allowed to reproduce copyrighted works in a commercial fashion, what would make you think that a computer reproducing copyrighted works would be ok?

And regarding derivative works:

Check out Vanilla Ice vs Queen. Vanilla Ice just used 7 notes from the Queen song "Under Pressure" in his song "Ice Ice Baby".

That was enough that he had to pay royalties for that.

So if a human has to pay for "borrowing" seven notes from a copyrighted work, why would a computer not have to?

[–] [email protected] 1 points 1 year ago

The key there is anyone profiting from the copyrighted work. I've been to big public events where the have sung Happy Birthday, things that may very have been recorded but none of us were sued because there was no damages, no profits lost.

The other big question is what are these lawsuits basing their complaint on. If I understand the Sarah Silverman claim is that she could go into ChatGPT and ask it for pages from her book and it generated them. Never once have i used ChatGPT and had it generate pages from her book so the question is the difference between my and her experience? The difference is she asked for that material. This may seem trivial but on the basis of how the technology works it's important.

You can go through their LLM and no where will you find her book. No where will you find pages of her book. No where will you find encoded or encrypted versions of her book. Rather, you'll find a data model with values showing the probability of a text output for given prompts. The model sometime generates valid responses and sometimes it gives wrong answers. Why? Because its a language model and not a library of text.

So the question now becomes, what is it the content creators are upset about? The fact that they asked it to generate content that turned out to match their own or that their content was used to teach the LLM. Because in no case is there a computer somewhere that has their text verbatim existing somewhere waiting to be displayed. If its about the output then I'd want to know how this is different than singing happy birthday. If I'm prompting the AI and then there are no damages, i don't use it for anything of fiduciary gains I'm not seeing an issue.