145
The Open-Source Software Saving the Internet From AI Bot Scrapers
(www.404media.co)
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
Interesting. Judging by that option's name, it seems to refer to use of the HTML
<meta>
tag to refresh a page.https://developer.mozilla.org/en-US/docs/Web/HTML/Reference/Elements/meta/http-equiv
Neither this tag nor using it for refresh is new at all. I don't think I've seen it used to detect bots, though. I wonder what Anubis is doing here.
It's simply checking if the connection is from an actual browser, as a scraper pretending to be one won't actually refresh the page as instructed. It's going to buy some time, but like the rest of Anubis in general, it will only work until the scrapers get modified to work around it.