226
submitted 1 week ago by [email protected] to c/[email protected]

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

you are viewing a single comment's thread
view the rest of the comments
[-] [email protected] 2 points 5 days ago* (last edited 5 days ago)

They're just using very simple scrapers that don't have any knowledge about how the site operates. The simplest counter would probably be using Anubis on the web interface.

I wouldn't mind waiting 2-3 seconds when first loading the site and mobile apps would remain unaffected since they use the API.

this post was submitted on 08 Aug 2025
226 points (99.6% liked)

Privacy

2216 readers
256 users here now

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

founded 2 years ago
MODERATORS