Anubis is awesome! Stopping (AI)crawlbots

zoey@lemmy.librebun.com · edit-2 5 months ago

Anubis is awesome! Stopping (AI)crawlbots

BakedCatboy@lemmy.ml · 5 months ago

Fwiw Anubis is adding a nojs meta refresh challenge that if it doesn’t have issues will soon be the new default challenge

dan@upvote.au · 5 months ago

Won’t the bots just switch to using that instead of the heavier JS challenge?

Sekoia@lemmy.blahaj.zone · 5 months ago

They can, but it’s not trivial. The challenge uses a bunch of modern browser features that these scrapers don’t use, regarding metadata and compression and a few other things. Things that are annoying to implement and not worth the effort. Check the recent discussion on lobste.rs if you’re interested in the exact details.

baod_rate@programming.dev · 5 months ago

Check the recent discussion on lobste.rs if you’re interested in the exact details.

For those coming from the future: https://lobste.rs/s/aa7ske/anubis_now_supports_non_js_challenges

yetAnotherUser@discuss.tchncs.de · 5 months ago

Plus even if they were to implement those features, the challenges would still get increasingly harder the more bot-like a scraper behaves.

You can’t prevent scraping entirely but you can certainly prevent scraping that behaves like a DOS attack.

Anubis is awesome! Stopping (AI)crawlbots

Anubis is awesome! Stopping (AI)crawlbots

Incoherent rant.

Behold, Anubis.

“Weighs the soul of incoming HTTP requests to stop AI crawlers”