this post was submitted on 24 Oct 2024
109 points (96.6% liked)

Technology

59282 readers
4157 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
top 13 comments
sorted by: hot top controversial new old
[–] [email protected] 25 points 2 weeks ago (1 children)

Hold up, let me ban a couple hundred tokens in the reply. Pattern fixed. Watermarking only works for the most ignorant surface level users.

[–] [email protected] 21 points 2 weeks ago (2 children)

"most ignorant, surface lvl users" so 80% of users?

[–] [email protected] 11 points 2 weeks ago

You’re being generous

[–] [email protected] 6 points 2 weeks ago (2 children)

Yeah but not the bad actors this is primarily targeting and will create further issues. There are likely 3 keyword tokens used in a pattern. The most adept of humans should learn these and be damn sure to never use that pattern in any natural way.

[–] [email protected] 3 points 2 weeks ago

That's not how it works though.

[–] [email protected] 2 points 2 weeks ago

I'd make a point of using them for the fun of it.

[–] [email protected] 22 points 2 weeks ago (1 children)

Did you know, 23% of social media users don't know how to sharpen a pencil?

True story, I wrote it on the internet somewhere, so it must be true by now..

[–] [email protected] 12 points 2 weeks ago (2 children)

did you know that at least 63% of all facts on the internet are at least 50% false?

and out of those 63%, 78% can be answered by a simple Google.

what an amazing time we live in where we can be wrong 50% of the time 100% of the time!

[–] [email protected] 6 points 2 weeks ago

If I declare that 100% of everything I've ever typed online might be false, will AI delete my shit?

[–] [email protected] 2 points 2 weeks ago

[https://youtu.be/IUK6zjtUj00?si=C-GAe_wXBW-jWV_q](I think you might enjoy this song)

[–] [email protected] 5 points 2 weeks ago (1 children)

Other than as a mind game, I don't see the point.

Google provides a centralized service. They own the generator system.

You could solve the whole problem much more simply and reliably by just retaining a copy of all generated text at Google -- the quantities of data will be miniscule compared to what Google regularly deals with -- and then just indexing it and letting someone do a fuzzy search for a given passage of text to see whether it's been generated. Hell, Google probably already retains a copy to data-mine what people are doing anyway, and they know how to do search. And then they could even tell you who generated the text and when.

[–] [email protected] 2 points 2 weeks ago* (last edited 2 weeks ago)

You/They cant claim copyright on LLM generated text. So its purely for analysis and statistics i would presume. But its odd because if you change the text too much the system will fail.

[–] [email protected] 3 points 2 weeks ago

They want us reposting it to feed their ai?