A Project to Poison LLM Crawlers

Disillusionist@piefed.world · 18 hours ago

A Project to Poison LLM Crawlers

Taldan@lemmy.world · 7 hours ago

As someone who makes and uses software, I feel it is not okay to steal source code. I wouldn’t feel okay with myself getting something for free when it’s based on the stolen work of tens of thousands of people

AI companies aren’t respecting crawler blocking. They’re actively working to ensure their crawlers bypass any anti-crawler protections

As a side note, these efforts help AI in the long-term. If we can poison LLMs, then you can guarantee a state actor can as well. AI needs to be able to weather training data attacks, otherwise they become an easily manipulated propaganda tool