31
submitted 1 day ago by [email protected] to c/[email protected]
top 7 comments
sorted by: hot top new old
[-] [email protected] 2 points 15 hours ago* (last edited 15 hours ago)

Its also interesting that this is the most conservative, pro “its not just memorizing” estimation possible : they multiplied the probabilities of consequent tokens. Basically it means if it starts shitting out a quote it will not be able to stop quoting until their anti copy the whole book finetuning kicks in after 50 words or so.

It can probably output far more under a realistic test (always picking the top token, temperature =0)

[-] [email protected] 0 points 1 day ago
[-] [email protected] 2 points 15 hours ago

Just the pronouns and articles, some of the verbs and adjectives

[-] [email protected] 21 points 1 day ago

Why is plagiarism an accomplishment? I can reproduce the entire Harry Potter book with a scanner and some OCR software.

[-] [email protected] 6 points 1 day ago

It's a sign that the AI companies are complicit in industrial level copyright violation. There was a recent US court decision that a book publisher brought again Anthropic. In it the judge rules that the use of the book contents was "spectacularly transformative" (and therefore "fair use") because the resulting machine did not copy the work.

People are trying to prove the judge wrong. It does copy the work.

[-] [email protected] 25 points 1 day ago

That's the point of the article, I think. They're saying AI models are copyright violators.

[-] [email protected] 9 points 1 day ago

It isn't. Its a sign of poor-quality work.

this post was submitted on 29 Jun 2025
31 points (91.9% liked)

Fuck AI

3272 readers
846 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 1 year ago
MODERATORS