this post was submitted on 15 Nov 2024
350 points (94.9% liked)

Technology

59434 readers
2976 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 16 points 1 day ago

Can I get that written in a contract?

[–] [email protected] 8 points 21 hours ago

There is zero reason to believe this to be true.

[–] [email protected] 21 points 1 day ago
[–] [email protected] 20 points 1 day ago

…at the moment

[–] [email protected] 15 points 1 day ago (1 children)

The API is so cheap someone is going to do it anyways...

[–] [email protected] 3 points 21 hours ago

That's my biggest issue with AI. Now all the APIs want me to pay to use them. Originally the API was an incentive to get engineers to build features for their sites without having to pay them. The engineers get data they need to build their thing and the company providing the API gets free product features and user acquisition. It was mostly a fair trade. Now the companies see a few other companies using that data to train API and make a fortune so their reaction is to charge a fortune for the API. Totally disregarding the previous arrangement. If you are an engineer working on something unrelated to AI you are basically shut down from using any APIs that provide useful data. Everything is locked down now when it once was open. It's so sad. It makes learning more restrictive

[–] [email protected] 23 points 1 day ago

Yeah, I remember when Reddit promised similar things...

[–] [email protected] 14 points 1 day ago

Nah, first you gotta get comfortable for a couple of years.

It's basically pig butchering for social networks.

[–] [email protected] 39 points 2 days ago (1 children)
[–] [email protected] 14 points 1 day ago

"Bluesky has not been offered enough money to scrape user data for AI"

[–] [email protected] 23 points 2 days ago* (last edited 2 days ago) (1 children)

To be fair, "they" could probably train AI on Lemmy data, they just won't ask for permission and won't be charged for it

[–] [email protected] 6 points 1 day ago

AI is 100% being trained using Lemmy.

[–] [email protected] 23 points 2 days ago (1 children)

They also said it was decentralized which is not true.

I don't believe this.

[–] [email protected] 6 points 2 days ago (1 children)

well there's a protocol but everyone is on the main one, i don't think theres even a non personal instance.

[–] [email protected] 6 points 1 day ago

Yeah, I looked into it and the backend is proprietary, so the central owner can restrict features. Like for instance independent instances can only have 10 users.

It's "decentralised" except only in extremely limited scope, the code is centrally controlled and the network remains largely, functionally centralised.

They're capitalising on the decentralised, federated buzz while doing it so poorly they're setting up users to say "oh people tried decentralisation, it doesn't work, look at Bluesky".

If it's not open source, it's not decentralised.

[–] [email protected] 20 points 2 days ago (1 children)

Bluesky is VC backed. They'll want to make money down the road, and they'll definitely train AI soon if not already.

[–] [email protected] 6 points 2 days ago (2 children)

Maybe the VC's are dying soon and they wanted to do something useful with their exorbitant wealth before they die

[–] [email protected] 2 points 20 hours ago
[–] [email protected] -2 points 22 hours ago (1 children)

What’s the problem of training AI with my posts?

[–] [email protected] 4 points 21 hours ago (1 children)

Nothing if you do it yourself but someone else doing it without your permission and making tons of money off of it and not sharing it isn't very cool so this is nice.

[–] [email protected] -2 points 21 hours ago

Yeah, they should ask for permission, I don’t see the point of not sharing it though, they will only make tons of money if the AI is good, don’t they? I think ChatGPT and some other AIs are amazing and if they want my data for helping it, I would allow it.

[–] [email protected] 134 points 2 days ago (3 children)

Oh it will as soon as the investors demand more ReTurN oN iNvEsTmEnT.

[–] [email protected] 38 points 2 days ago (2 children)

Then we leave that platform too. I have zero loyalty. Zero.

[–] [email protected] 6 points 2 days ago

But they'll have stragglers just the same as any major social media site. Even though many here have standards they won't easily abandon, there are scores of people that won't even know if/when AI started being used on the site or would care enough to leave if they did.

Plus every time we leave a platform we need to find or build a new one. The time it takes to get others to migrate and develop into a worthwhile community is hard to predict and it may not even work out. It sucks social media is such shit anymore, but it seems inevitable that it will remain that way given the landscape of the Internet at this point.

I say this as someone who's drifted from Fark to Digg to Reddit to Lemmy over the past 20-25 years 🎈 (zero loyalty as well)

load more comments (1 replies)
load more comments (2 replies)
[–] [email protected] 26 points 2 days ago (1 children)
load more comments (1 replies)
[–] [email protected] 76 points 2 days ago (2 children)
load more comments (2 replies)
[–] [email protected] 46 points 2 days ago

Won't train AI on your posts ~~until we reach critical mass of users~~.

[–] [email protected] 56 points 2 days ago (1 children)

"Well, WE won't train on your data. But this subsidiary company we created on the other hand..."

[–] [email protected] 40 points 2 days ago (1 children)

Or one of our 12675 carefully selected partners

load more comments (1 replies)
[–] dsilverz 39 points 2 days ago (3 children)

Sounds exactly like something that someone intending to train an AI would say.

load more comments (3 replies)
[–] [email protected] 19 points 2 days ago (1 children)

It's open to the public. So, many other orgs are certainly doing it anyway.

[–] [email protected] 10 points 2 days ago (1 children)

The same can be said of lemmy, mastodon or any publically accessible forum

[–] [email protected] 7 points 1 day ago

Absolutely. I worked somewhere where we routinely had alarms go off due to botnets swarming us with weird (and obnoxious) massive download tactics (of publicly available user generated content, that is).

If it can be gotten by anyone, it will be gotten by LLM trainers.

[–] [email protected] 32 points 2 days ago

BlueskAI on the other hand...

[–] [email protected] 28 points 2 days ago* (last edited 2 days ago)

If the AT protocol allows public access to content, they can’t create a proprietary training set. But the content is available for anyone who wants to add it to a public training set.

load more comments
view more: next ›