Drunk & Root@sh.itjust.works to Selfhosted@lemmy.worldEnglish · 1 day agoHow to combat large amounts of Ai scrapersmessage-squaremessage-square39fedilinkarrow-up193arrow-down14file-text
arrow-up189arrow-down1message-squareHow to combat large amounts of Ai scrapersDrunk & Root@sh.itjust.works to Selfhosted@lemmy.worldEnglish · 1 day agomessage-square39fedilinkfile-text
everytime i check nginx logs its more scrapers then i can count and i could not find any good open source solutions
minus-squarefubarx@lemmy.worldlinkfedilinkEnglisharrow-up8·16 hours agoIf nginx, here’s an open-source blocker/honeypot: https://github.com/raminf/RoboNope-nginx If you have it set up to be proxied or hosted by Cloudflare, they have their own solution: https://blog.cloudflare.com/declaring-your-aindependence-block-ai-bots-scrapers-and-crawlers-with-a-single-click/
minus-squareDrunk & Root@sh.itjust.worksOPlinkfedilinkEnglisharrow-up1·2 hours agoill check robonope out seems promising
minus-squareikidd@lemmy.worldlinkfedilinkEnglisharrow-up2·edit-28 hours agoI wonder why that RoboNope doesn’t just make a fail2ban entry for anything that accesses a disallowed url and drop them entirely. Actually this look like it would do something similiar, then dumps them to fail2ban after the re-access the honeypot page too many times: https://petermolnar.net/article/anti-ai-nepenthes-fail2ban/
If nginx, here’s an open-source blocker/honeypot: https://github.com/raminf/RoboNope-nginx
If you have it set up to be proxied or hosted by Cloudflare, they have their own solution: https://blog.cloudflare.com/declaring-your-aindependence-block-ai-bots-scrapers-and-crawlers-with-a-single-click/
ill check robonope out seems promising
I wonder why that RoboNope doesn’t just make a fail2ban entry for anything that accesses a disallowed url and drop them entirely.
Actually this look like it would do something similiar, then dumps them to fail2ban after the re-access the honeypot page too many times: https://petermolnar.net/article/anti-ai-nepenthes-fail2ban/