Sunshine@piefed.ca to Fedibridge@lemmy.dbzer0.comEnglish · edit-28 days agoReddit will block the Internet Archivewww.theverge.comexternal-linkmessage-square7fedilinkarrow-up196arrow-down11file-textcross-posted to: [email protected][email protected][email protected]
arrow-up195arrow-down1external-linkReddit will block the Internet Archivewww.theverge.comSunshine@piefed.ca to Fedibridge@lemmy.dbzer0.comEnglish · edit-28 days agomessage-square7fedilinkfile-textcross-posted to: [email protected][email protected][email protected]
minus-squareRiskable@programming.devlinkfedilinkEnglisharrow-up20·8 days agoSo let me get this straight: Instead of wasting Reddit’s bandwidth, AI companies have been scraping the wayback machine. Because of this, Reddit is going to block the wayback machine from crawling it’s site which will ensure the AI companies crawl Reddit, directly. …Because if you think they’re suddenly going to stop crawling reddit—robots.txt be damned—you’re dreaming.
So let me get this straight: Instead of wasting Reddit’s bandwidth, AI companies have been scraping the wayback machine.
Because of this, Reddit is going to block the wayback machine from crawling it’s site which will ensure the AI companies crawl Reddit, directly.
…Because if you think they’re suddenly going to stop crawling reddit—robots.txt be damned—you’re dreaming.