this post was submitted on 04 Apr 2024
293 points (94.3% liked)

Reddit

17636 readers
397 users here now

News and Discussions about Reddit

Welcome to !reddit. This is a community for all news and discussions about Reddit.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules


Rule 1- No brigading.

**You may not encourage brigading any communities or subreddits in any way. **

YSKs are about self-improvement on how to do things.



Rule 2- No illegal or NSFW or gore content.

**No illegal or NSFW or gore content. **



Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.



Rule 4- No self promotion or upvote-farming of any kind.

That's it.



Rule 5- No baiting or sealioning or promoting an agenda.

Posts and comments which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.



Rule 6- Regarding META posts.

Provided it is about the community itself, you may post non-Reddit posts using the [META] tag on your post title.



Rule 7- You can't harass or disturb other members.

If you vocally harass or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.



Rule 8- All comments should try to stay relevant to their parent content.



Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.



:::spoiler Rule 10- Majority of bots aren't allowed to participate here.

founded 1 year ago
MODERATORS
 

Its a bit old, but I just learned it via the retro-dodo article here: https://retrododo.com/google-is-killing-retro-dodo/

top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 90 points 7 months ago (2 children)

ChatGPT4: tl;dr The universe is bigger than we thought.

ChatGPT5: fuck spez

[–] [email protected] 8 points 7 months ago (1 children)

Wrong company, it would be Gemini in this case

load more comments (1 replies)
[–] [email protected] 60 points 7 months ago (7 children)

Is it just me or are 60 million a ridiculously small price for that whole dataset?

[–] [email protected] 69 points 7 months ago (6 children)

To be fair it's a pretty terrible dataset. The AI is just going to say "this" to every question you ask

[–] [email protected] 36 points 7 months ago
[–] [email protected] 31 points 7 months ago (1 children)
[–] [email protected] 11 points 7 months ago (2 children)
load more comments (2 replies)
[–] [email protected] 5 points 7 months ago (3 children)
load more comments (3 replies)
load more comments (2 replies)
[–] [email protected] 17 points 7 months ago (4 children)

Perhaps, but not worth buying if you can't make profit or keep it from your competition.

60M is for over almost 20 years of data, but once it's ingested, google will only want new content. Next year, it'll be more like 3M if the dataset isn't poisoned by bots or the AI fad hasn't collapsed. Reddit will struggle with finances again and users will suffer. At least that's my prediction.

[–] [email protected] 7 points 7 months ago (3 children)

Spez has already grifted his money out of the initial stock pump so it literally doesn't matter. Reddit could shut down tomorrow and he'd be happy as a clam.

load more comments (3 replies)
load more comments (3 replies)
[–] [email protected] 12 points 7 months ago

I wonder if Google's unlimited legal budget plays a role. Not a lawyer, so probably way off here...

But, for example, reddit's success in part depends on Google ingesting their data


reddit shows up in Google searches all the time, which can only happen if Google uses reddit's content. So reddit telling Google "you can't use our content" doesn't work, and they need to say something like, "you can use our content for search results but you can't consume it as training data."

This is a pretty straightforward statement/request/demand, but one could imagine Google lawyers maliciously complying and throwing their hands up dramatically, claiming "well we use some amount of AI in our search results, so if we can't use your content for AI training then we can't risk using it for search results." Which would, I imagine, really, really hurt reddit (no Google results would be catastrophic I suspect).

So, perhaps the "low" 60M figure is just Google using their leverage.

Or not. As a random person on the Internet, I can say I'm probably not contributing anything meaningful here...

[–] [email protected] 6 points 7 months ago

How quickly you forget that half of it is just "I also choose this guy's wife" and "the narwhal bacon's at midnight"

[–] [email protected] 4 points 7 months ago

I'm personally curious whether Reddit actually has any ability to protect that database. I don't remember Reddit TOS, but usually those things give them license to use and copy the data, maybe even to sell it, but not actually the copyright on it. So if someone made a Reddit scraper and copied the comments, wouldn't only the actual commenter be able to sue?

$60M may be reflecting that, in that it's more a convenience fee to shield Google against individual Redditors going after them than something that Reddit itself could actually sue over.

load more comments (2 replies)
[–] [email protected] 37 points 7 months ago (7 children)

Can't wait to see an AI chatbot in my Google searches that behaves like a typical redditor.

[–] [email protected] 26 points 7 months ago (1 children)

Every thing you google is just going to direct you to a link to let me google that for you

[–] [email protected] 6 points 7 months ago

I love that site.

[–] [email protected] 6 points 7 months ago
[–] [email protected] 6 points 7 months ago

"Hey Google AI, could you help me find a way to do _____?"

"Why the hell would you want that? Are you stupid? There's like 15 better ways to accomplish what I think your peanut brain is trying to accomplish."

"... nevermind Google."

load more comments (4 replies)
[–] [email protected] 36 points 7 months ago* (last edited 7 months ago) (1 children)

Google: "I'm looking to make an AI that's incredibly opinionated, confidently incorrect, and prone to circlejerk behaviors."

Spez: "I mean OpenAI pretty much did that already, but if you want to pay me to recreate that then I got you."

[–] [email protected] 6 points 7 months ago

One thing that doesn't seem to get brought up is the AI porn angle. Gonewild is pretty big on reddit still. A lot of OnlyFans creators, and general perverts (respect). Reddit wants to commodify this content, so those people are selling the images of their bodies so AI can make porn of random strangers. Kinda fucked when you think about it.

[–] [email protected] 27 points 7 months ago (2 children)

AI be like “stfu regard”

load more comments (2 replies)
[–] [email protected] 23 points 7 months ago

Steve Huffman looks increasingly douchier and shittier with every passing photo.

What a damn chode. Fuck that guy.

[–] [email protected] 21 points 7 months ago (3 children)

Oh no, my thousands of identical messages!

[–] [email protected] 11 points 7 months ago (2 children)

You sir are a scholar and a gentleman.

I also choose this man's wife.

[–] [email protected] 5 points 7 months ago (1 children)
[–] [email protected] 4 points 7 months ago

Scrolled too far down to find this

load more comments (1 replies)
load more comments (2 replies)
[–] [email protected] 18 points 7 months ago (1 children)

And this is how Skynet was born.

That one Microsoft Twitter bot turned into a full blown Nazi in just one day.

I can't even imagine how fucked up and depraved one trained on Reddit data will get.

[–] [email protected] 6 points 7 months ago* (last edited 7 months ago) (1 children)

They have a series of safeguards against that now. They've actually taken it in the extreme other direction now where it can't give you anything without injecting diversity in there somewhere.

Here's an example. This is what it produced when asking for an image of a German soldier in 1943.

load more comments (1 replies)
[–] [email protected] 17 points 7 months ago (2 children)

Can someone point me the way of that bot or whatever that changes all your old Reddit posts before deleting them? I thought I had it saved somewhere but I can't find it now and have no idea what it's called.

[–] [email protected] 14 points 7 months ago (4 children)

They keep copies of posts because people who mass edited their posts saw them reverted or have people reply still as if they were not edited.

load more comments (4 replies)
[–] [email protected] 17 points 7 months ago* (last edited 7 months ago) (1 children)

I deleted my comment history after the API exodus. I'm sure they could dig it up if they wanted but at least they'll have to click like 3 more buttons if they want to train AI on my nonsense.

load more comments (1 replies)
[–] [email protected] 17 points 7 months ago (1 children)

I hope AI sais fuck Spez a lot:-)

load more comments (1 replies)
[–] [email protected] 11 points 7 months ago

As part of the deal, spez will personally train the AI Jailbait Model.

[–] [email protected] 10 points 7 months ago

60m? Ms got a steal no wonder Reddit can’t monetize

[–] [email protected] 9 points 7 months ago

They can have my drunk shitposts. Jokes on them for buying my garbage.

[–] [email protected] 7 points 7 months ago

Grateful this is no longer my problem

[–] [email protected] 6 points 7 months ago (6 children)

is there a way to mass delete my old content? the service i used in the past doesn't seem to have worked. i recently got a reply from a 6 year-old post from someone saying they got there on google.

load more comments (6 replies)
[–] [email protected] 5 points 7 months ago

Welp... Time to nuke it all.

[–] [email protected] 5 points 7 months ago

*your posts

[–] [email protected] 4 points 7 months ago

I wonder how they will decide how to dole that out to their contributors… oh, wait.

[–] [email protected] 4 points 7 months ago (1 children)

That's not great news when weighed against my desire to watch reddit crash and burn.

[–] [email protected] 7 points 7 months ago

All we can do is make something better, reddit will do their thing and we will do ours.

load more comments
view more: next ›