this post was submitted on 25 Jul 2024
350 points (97.8% liked)

Technology

59143 readers
2264 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 69 points 3 months ago (3 children)

I'm sure they've convinced the board and the shareholders that this is some kind of big win. But I don't think it's going to be impressive for very long.

There's only so much value an AI can learn from reddit bullshit like "1. break off all contact 2. hit the gym 3. profit" and "the narwhal bacons at midnight" and endless boring pun threads.

[–] [email protected] 31 points 3 months ago

Short term profit is all they care about until this platform crashes down completely

[–] [email protected] 8 points 3 months ago (1 children)

It sounds a lot like this quote from Andrej Karpathy :

Turns out that LLMs learn a lot better and faster from educational content as well. This is partly because the average Common Crawl article (internet pages) is not of very high value and distracts the training, packing in too much irrelevant information. The average webpage on the internet is so random and terrible it's not even clear how prior LLMs learn anything at all.

[–] [email protected] 3 points 3 months ago* (last edited 3 months ago) (1 children)

So it will end in a downward spiral because it starts learning from AI articles, from which articles are being written, from which the AI learns, from which articles are being written ...

[–] [email protected] 1 points 3 months ago (1 children)

As long as there’s supervision during training, which there always will be, this isn’t really a problem. This just shows how bad it can get if you just train on generated stuff.

[–] [email protected] 3 points 3 months ago (1 children)

which there always will be

How? We just learned that they train on social media.

[–] [email protected] 1 points 3 months ago

They don't train on random social media posts. Everything is sorted and approved.

[–] [email protected] 4 points 3 months ago (1 children)

Well it learned to put glue on pizza, eat rocks, and smoke while pregnant.

[–] [email protected] 1 points 3 months ago

You forgot jumping off the Golden Gate bridge