this post was submitted on 31 Jul 2024
28 points (100.0% liked)
Chat
7500 readers
22 users here now
Relaxed section for discussion and debate that doesn't fit anywhere else. Whether it's advice, how your week is going, a link that's at the back of your mind, or something like that, it can likely go here.
Subcommunities on Beehaw:
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
reddit recently updated their robots.txt to disallow all crawlers Google paid a bunch of money to have access to crawl reddit
You'll still see old stuff, but crawlers that care about robots.txt will get no new information.
The best part of that robots.txt is:
Sure Jan.
Reddit believes in ~~an open~~ pay to access internet, but not the ~~mis~~use of ~~public content~~ our content we didn't make.
https://search.brave.com/search?q=site%3Areddit.com+trump&source=web&tf=pd Past day is not that old
A few possibilities,
No matter the reason, well behaving crawlers will no longer crawl reddit, Everything is disallowed in the robots.txt