damn i really hope they stay. this right after their spotify crawl and domain suspension doesn’t inspire hope.

  • hexagonwin@lemmy.sdf.orgOP
    link
    fedilink
    English
    arrow-up
    6
    ·
    11 hours ago

    i think they mean they’ll provide direct access to data hosted by "third party"s (torrents?), without the captchas and throttling/rate limiting present when normally using the annas archive website

    they’re asking for text extraction and dedup in exchange for providing datasets. at least publicly they claim this whole project is aimed at data preservation and wide access… they’re mostly aggregating/collecting data from other shadow libraries and even if they have malicious(?) intent, i’d say they’re a net positive since their code and datas are mostly(?) open sourced.