16
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 19 Jan 2026
16 points (94.4% liked)
TechTakes
2416 readers
84 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
I knew that ai scraping was bad, but after hosting a service online for a bit I'm just amazed at how bad it is.
I blocked the ip ranges:
47.80.0.0/13, 47.74.0.0/15; 47.76.0.0/14(all owned by alibaba), and now my access log is 90%forbidden by rule, because these bots are so poorly coded that they just ignore 403s.Of all the 18522 requests I got today, only 230 were not forbidden.
If anything they sped up since I blocked them. Since this comment was posted they sent 4633 requests. All of which were blocked.
It makes me think that they're sufficiently poorly designed that it's treating the reset as a temporary communication issue. I wonder if you could use this to their detriment by configurating the server to silently drop the connection rather than RSTing it. From your server's side it should look fairly similar, but from their side they actually have to spend the time putting together and sending the HTTP request before getting shut down.