this post was submitted on 03 Feb 2025
663 points (98.5% liked)

Technology

61346 readers
3928 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Originality.AI looked at 8,885 long Facebook posts made over the past six years.

Key Findings

  • 41.18% of current Facebook long-form posts are Likely AI, as of November 2024.
  • Between 2023 and November 2024, the average percentage of monthly AI posts on Facebook was 24.05%.
  • This reflects a 4.3x increase in monthly AI Facebook content since the launch of ChatGPT. In comparison, the monthly average was 5.34% from 2018 to 2022.
top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 35 seconds ago

The other 60% are old people re-sharing it.

[–] [email protected] 2 points 28 minutes ago* (last edited 28 minutes ago)

In the last month it has become a barrage. The algorithms also seem to be in overdrive. If I like something I get bombarded with more stuff like that within a day. I'd say 90% of my feed is shit that has nothing to do with anyone I know.

If it wasn't a way to stay in touch with family and friends I'd bail.

[–] [email protected] 6 points 1 hour ago (1 children)

I was wondering who Facebook was for, good to know AI has low standards

[–] [email protected] 2 points 1 hour ago

Dead internet theory

[–] [email protected] 20 points 3 hours ago (1 children)

FB has been junk for more than a decade now, AI or no.

I check mine every few weeks because I'm a sports announcer and it's one way people get in contact with me, but it's clear that FB designs its feed to piss me off and try to keep me doomscrolling, and I'm not a fan of having my day derailed.

[–] [email protected] 8 points 3 hours ago (2 children)

I deleted facebook in like 2010 or so, because i hardly ever used it anyway, it wasn't really bad back then, just not for me. 6 or so years later a friend of mine wanted to show me something on fb, but couldn't find it, so he was just scrolling, i was blown away how bad it was, just ads and auto played videos and absolute garbage. And from what i understand, it just got worse and worse. Everyone i know now that uses facebook is for the market place.

[–] [email protected] 2 points 2 hours ago* (last edited 2 hours ago)

My brother gave me his Facebook credentials so I could use marketplace without bothering him all the time. He's been a liberal left-winger all his life but for the past few years he's taken to ranting about how awful Democrats are ("Genocide Joe" etc.) while mocking people who believe that there's a connection between Trump and Putin. Sure enough, his Facebook is filled with posts about how awful Democrats are and how there's no connection between Trump and Putin - like, that's literally all that's on there. I've tried to get him to see that his worldview is entirely created by Facebook but he just won't accept it. He thinks that FB is some sort of objective collator of news.

In my mind, this is really what sets social media apart from past mechanisms of social control. In the days of mass media, the propaganda was necessarily a one-size-fits-all sort of thing. Now, the pipeline of bullshit can be custom-tailored for each individual. So my brother, who would never support Trump and the Republicans, can nevertheless be fed a line of bullshit that he will accept and help Trump by not voting (he actually voted Green).

[–] [email protected] 4 points 3 hours ago

It's such a cesspit.

I'm glad we have the Fediverse.

[–] [email protected] 4 points 2 hours ago

And 58.82% are likely generated by human junk then.

[–] [email protected] 3 points 3 hours ago

If you want to visit your old friends in the dying mall. Go to feeds then friends. Should filter everything else out.

[–] [email protected] 1 points 2 hours ago

Not my Annie! No! Not my Annie!

[–] [email protected] 13 points 6 hours ago (1 children)

This kind of just looks like an add for that companies AI detection software NGL.

[–] [email protected] 10 points 5 hours ago

this whole concept relies on the idea that we can reliably detect AI, which is just not true. None of these "ai detector" apps or services actually works reliably. They have terribly low success rates. the whole point of LLMs is to be indistinguishable from human text, so if they're working as intended then you can't really "detect" them.

So all of these claims, especially the precision to which they write the claims (24.05% etc), are almost meaningless unless the "detector" can be proven to work reliably.

[–] [email protected] 1 points 3 hours ago (1 children)

That’s an extremely low sample size for this

[–] [email protected] 4 points 2 hours ago (1 children)

8,855 long-form Facebook posts from various users using a 3rd party. The dataset spans from 2018 to November 2024, with a minimum of 100 posts per month, each containing at least 100 words.

seems like thats a good baseline rule and that was about the total number that matched it

[–] [email protected] 2 points 2 hours ago* (last edited 2 hours ago) (1 children)

With apparently 3 billion active users

Only summing up 9k posts over a 6 year stretch with over 100 words feels like an outreach problem. Conclusion could be drawn that bots have better reach

[–] [email protected] 1 points 2 hours ago (1 children)

each post has to be 100 words with at least 100 posts a month

how many actual users do that?

[–] [email protected] 2 points 2 hours ago* (last edited 2 hours ago) (1 children)

I have no idea because I don’t use the site

But to say less than 0.0001% just seems hard to believe

[–] [email protected] 2 points 2 hours ago (1 children)

I don't use the site either but 100 words is a lot for a facebook post

[–] [email protected] 1 points 33 minutes ago

My number also assumes one post per person so it’s overestimating the %

[–] [email protected] 1 points 3 hours ago (1 children)

If you could reliably detect "AI" using an "AI" you could also use an "AI" to make posts that the other "AI" couldn't detect.

[–] [email protected] 5 points 3 hours ago (1 children)

Sure, but then the generator AI is no longer optimised to generate whatever you wanted initially, but to generate text that fools the detector network, thus making the original generator worse at its intended job.

[–] [email protected] 1 points 3 hours ago (1 children)

I see no reason why "post right wing propaganda" and "write so you don't sound like "AI" " should be conflicting goals.

The actual argument why I don't find such results credible is that the "creator" is trained to sound like humans, so the "detector" has to be trained to find stuff that does not sound like humans. This means, both basically have to solve the same task: Decide if something sounds like a human.

To be able to find the "AI" content, the "detector" would have to be better at deciding what sounds like a human than the "creator". So for the results to have any kind of accuracy, you're already banking on the "detector" company having more processing power / better training data / more money than, say, OpenAI or google.

But also, if the "detector" was better at the job, it could be used as a better "creator" itself. Then, how would we distinguish the content it created?

[–] [email protected] 1 points 2 hours ago

I'm not necessarily saying they're conflicting goals, merely that they're not the same goal.

The incentive for the generator becomes "generate propaganda that doesn't have the language chatacteristics of typical LLMs", so the incentive is split between those goals. As a simplified example, if the additional incentive were "include the word bamboo in every response", I think we would both agree that it would do a worse job at its original goal, since the constraint means that outputs that would have been optimal previously are now considered poor responses.

Meanwhile, the detector network has a far simpler task - given some input string, give back a value representing the confidence it was output by a system rather than a person.

I think it's also worth considering that LLMs don't "think" in the same way people do - where people construct an abstract thought, then find the best combinations of words to express that thought, an LLM generates words that are likely to follow the preceding ones (including prompts). This does leave some space for detecting these different approaches better than at random, even though it's impossible to do so reliably.

But I guess really the important thing is that people running these bots don't really care if it's possible to find that the content is likely generated, just so long as it's not so obvious that the content gets removed. This means they're not really incentivised to spend money training models to avoid detection.

[–] [email protected] 40 points 10 hours ago (1 children)

Keep in mind this is for AI generated TEXT, not the images everyone is talking about in this thread.

Also they used an automated tool, all of which have very high error rates, because detecting AI text is a fundamentally impossible task

[–] [email protected] 3 points 8 hours ago (1 children)

AI does give itself away over "longer" posts, and if the tool makes about an equal number of false positives to false negatives then it should even itself out in the long run. (I'd have liked more than 9K "tests" for it to average out, but even so.) If they had the edit history for the post, which they didn't, then it's more obvious. AI will either copy-paste the whole thing in in one go, or will generate a word at a time at a fairly constant rate. Humans will stop and think, go back and edit things, all of that.

I was asked to do some job interviews recently; the tech test had such an "animated playback", and the difference between a human doing it legitimately and someone using AI to copy-paste the answer was surprisingly obvious. The tech test questions were nothing to do with the job role at hand and were causing us to select for the wrong candidates completely, but that's more a problem with our HR being blindly in love with AI and "technical solutions to human problems".

"Absolute certainty" is impossible, but balance of probabilities will do if you're just wanting an estimate like they have here.

[–] [email protected] 3 points 8 hours ago (3 children)

I have no idea whether the probabilities are balanced. They claim 5% was AI even before chatgpt was released, which seems pretty off. No one was using LLMs before chatgpt went viral except for researchers.

[–] [email protected] 1 points 28 minutes ago

chat bots have been a thing, for a long time. I mean, a half decently trained Markov can handle social media postings and replies

[–] [email protected] 3 points 5 hours ago

Chatbots doesn't mean that they have a real conversation. Some just spammed links from a list of canned responses, or just upvoted the other chat bots to get more visibility, or the just reposted a comment from another user.

[–] [email protected] 2 points 6 hours ago

Im pretty sure chatbots were a thing before AI. They certainly werent as smart but they did exists.

[–] [email protected] 5 points 7 hours ago (1 children)

Title says 40% of posts but the article says 40% of long-form posts yet doesn't in any way specify what counts as a long-form post. My understanding is that the vast majority of Facebook posts are about the lenght of a tweet so I doubt that the title is even remotely accurate.

[–] [email protected] 1 points 4 hours ago

Yeah, the company that made the article is plugging their own AI-detection service, which I'm sure needs a couple of paragraphs to be at all accurate. For something in the range of just a sentence or two it's usually not going to be possible to detect an LLM.

[–] [email protected] 17 points 10 hours ago (3 children)

> uses ai slop to illustrate it

[–] [email protected] 2 points 2 hours ago

That laptop lol.

[–] [email protected] 11 points 10 hours ago (1 children)

The most annoying part of that is the shitty render. I actually have an account on one of those AI image generating sites, and I enjoy using it. If you're not satisfied with the image, just roll a few more times, maybe tweak the prompt or the starter image, and try again. You can get some very cool-looking renders if you give a damn. Case in point:

[–] [email protected] 5 points 9 hours ago

😍this is awesome!

A friend of mine has made this with your described method:

PS: 😆the laptop on the illustration in the article! Someone did not want pay for high end model and did not want to to take any extra time neither…

load more comments (1 replies)
[–] [email protected] 66 points 14 hours ago (4 children)

It's incredible, for months now I see some suggested groups, with an AI generated picture of a pet/animal, and the text is always "Great photography". I block them, but still see new groups every day with things like this, incredible...

[–] [email protected] 32 points 12 hours ago (3 children)

I have a hard time understanding facebook’s end game plan here - if they just have a bunch of AI readers reading AI posts, how do they monetize that? Why on earth is the stock market so bullish on them?

[–] [email protected] 27 points 12 hours ago (2 children)

As long as they can convince advertisers that the enough of the activity is real or enough of the manipulation of public opinion via bots is in facebook's interest, bots aren't a problem at all in the short-term.

load more comments (2 replies)
[–] [email protected] 15 points 10 hours ago (5 children)

Engagement.

It’s all they measure, what makes people reply to and react to posts.

People in general are stupid and can’t see or don’t care if something is AI generated

load more comments (5 replies)
load more comments (1 replies)
load more comments (3 replies)
[–] [email protected] 30 points 14 hours ago (5 children)

I’ve posted a notice to leave next week. I need to scrape my photos off, get any remaining contacts, and turn off any integrations. I was only there to connect with family. I can email or text.

FB is a dead husk fake feeding some rich assholes. If it’s coin flip AI, what’s the point?

load more comments (5 replies)
[–] [email protected] 5 points 9 hours ago (1 children)

Probably on par with the junk human users are posting

load more comments (1 replies)
load more comments
view more: next ›