this post was submitted on 16 Aug 2023
1256 points (94.0% liked)

Technology

59434 readers
2976 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
1256
Google search is over (mastodon.social)
submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]
 

Via @[email protected]

Right now if you search for "country in Africa that starts with the letter K":

  • DuckDuckGo will link to an alphabetical list of countries in Africa which includes Kenya.

  • Google, as the first hit, links to a ChatGPT transcript where it claims that there are none, and summarizes to say the same.

This is because ChatGPT at some point ingested this popular joke:

"There are no countries in Africa that start with K." "What about Kenya?" "Kenya suck deez nuts?"

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 113 points 1 year ago (2 children)

Oh, this is great... And because the ChatGPT transcript is highly ranked on Google, it's almost certainly going to be used for training ChatGPT. A feedback loop of shitty information. Praise ChatGPT!

[–] [email protected] 50 points 1 year ago (2 children)

Remember GIGO (Garbage In, Garbage Out). Due to years of SEO and content farming (which google profited from, so you get what you deserve assholes) most of the internet, by volume, is self-congratulatory, for profit, garbage, or, you know, reddit garbage. Hopefully someone points a large LLM at the library of congress or other large, well curated data source, but of course copyright will not allow, thanks mickey mouse. Wouldn't surprise me if the military is already on it, hopefully that leaks...

[–] [email protected] 22 points 1 year ago (1 children)

LLMs will eventually start feeding of search results from other LLMs and they'll just start regurgitating each others nonsense. If that isn't happening already.

Nobody is going to point an LLM at a good data source because that would mean spending money on actually useful stuff not fast cars and booze.

[–] [email protected] 4 points 1 year ago

Nobody is going to point an LLM at a good data source because that would mean spending money on actually useful stuff not fast cars and booze.

It would also mean that you need to sort through that data, and most people don't have the time or money to bother, not when it might reduce their data pool.

[–] [email protected] 2 points 1 year ago (1 children)

SEO and content farming (which google profited from

How exactly?

inb4 "google it" :D

[–] [email protected] 9 points 1 year ago

The purpose of content farming is to sell ads, google gets a cut, lion's share most likely.

[–] [email protected] 11 points 1 year ago (3 children)

It’s like an LLM incest party

[–] [email protected] 7 points 1 year ago

Sweet Home LLM!

[–] [email protected] 5 points 1 year ago

Shouldn't have bought all those stupid reddit comments