Can I get that written in a contract?
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
There is zero reason to believe this to be true.
For now
…at the moment
The API is so cheap someone is going to do it anyways...
That's my biggest issue with AI. Now all the APIs want me to pay to use them. Originally the API was an incentive to get engineers to build features for their sites without having to pay them. The engineers get data they need to build their thing and the company providing the API gets free product features and user acquisition. It was mostly a fair trade. Now the companies see a few other companies using that data to train API and make a fortune so their reaction is to charge a fortune for the API. Totally disregarding the previous arrangement. If you are an engineer working on something unrelated to AI you are basically shut down from using any APIs that provide useful data. Everything is locked down now when it once was open. It's so sad. It makes learning more restrictive
Yeah, I remember when Reddit promised similar things...
Nah, first you gotta get comfortable for a couple of years.
It's basically pig butchering for social networks.
......for now.
"Bluesky has not been offered enough money to scrape user data for AI"
To be fair, "they" could probably train AI on Lemmy data, they just won't ask for permission and won't be charged for it
AI is 100% being trained using Lemmy.
They also said it was decentralized which is not true.
I don't believe this.
well there's a protocol but everyone is on the main one, i don't think theres even a non personal instance.
Yeah, I looked into it and the backend is proprietary, so the central owner can restrict features. Like for instance independent instances can only have 10 users.
It's "decentralised" except only in extremely limited scope, the code is centrally controlled and the network remains largely, functionally centralised.
They're capitalising on the decentralised, federated buzz while doing it so poorly they're setting up users to say "oh people tried decentralisation, it doesn't work, look at Bluesky".
If it's not open source, it's not decentralised.
Bluesky is VC backed. They'll want to make money down the road, and they'll definitely train AI soon if not already.
Maybe the VC's are dying soon and they wanted to do something useful with their exorbitant wealth before they die
Funny!
Lol
What’s the problem of training AI with my posts?
Nothing if you do it yourself but someone else doing it without your permission and making tons of money off of it and not sharing it isn't very cool so this is nice.
Yeah, they should ask for permission, I don’t see the point of not sharing it though, they will only make tons of money if the AI is good, don’t they? I think ChatGPT and some other AIs are amazing and if they want my data for helping it, I would allow it.
Oh it will as soon as the investors demand more ReTurN oN iNvEsTmEnT.
Then we leave that platform too. I have zero loyalty. Zero.
But they'll have stragglers just the same as any major social media site. Even though many here have standards they won't easily abandon, there are scores of people that won't even know if/when AI started being used on the site or would care enough to leave if they did.
Plus every time we leave a platform we need to find or build a new one. The time it takes to get others to migrate and develop into a worthwhile community is hard to predict and it may not even work out. It sucks social media is such shit anymore, but it seems inevitable that it will remain that way given the landscape of the Internet at this point.
I say this as someone who's drifted from Fark to Digg to Reddit to Lemmy over the past 20-25 years 🎈 (zero loyalty as well)
Won't train AI on your posts ~~until we reach critical mass of users~~.
"Well, WE won't train on your data. But this subsidiary company we created on the other hand..."
Sounds exactly like something that someone intending to train an AI would say.
It's open to the public. So, many other orgs are certainly doing it anyway.
The same can be said of lemmy, mastodon or any publically accessible forum
Absolutely. I worked somewhere where we routinely had alarms go off due to botnets swarming us with weird (and obnoxious) massive download tactics (of publicly available user generated content, that is).
If it can be gotten by anyone, it will be gotten by LLM trainers.
BlueskAI on the other hand...
If the AT protocol allows public access to content, they can’t create a proprietary training set. But the content is available for anyone who wants to add it to a public training set.