cm0002@lemmy.world to Privacy@lemmy.dbzer0.com · 2 days agoResearchers Scrape 2 Billion Discord Messages and Publish Them Onlinewww.404media.coexternal-linkmessage-square17fedilinkarrow-up182arrow-down11cross-posted to: [email protected]
arrow-up181arrow-down1external-linkResearchers Scrape 2 Billion Discord Messages and Publish Them Onlinewww.404media.cocm0002@lemmy.world to Privacy@lemmy.dbzer0.com · 2 days agomessage-square17fedilinkcross-posted to: [email protected]
minus-squarecm0002@lemmy.worldOPlinkfedilinkarrow-up10·edit-22 days agoWell the DOI is a digital identifier for papers and other data references for sciency stuff. But that DOI just points to the actual paper https://www.arxiv.org/pdf/2502.00627
minus-squareAnEilifintChorcra@sopuli.xyzlinkfedilinkarrow-up1·edit-21 day agoLink to where the archive is https://zenodo.org/records/15170676 but its been restricted from downloading Note: Download access has been temporarily suspended at the request of the ICWSM program chairs. EDIT: lol I love the Internet Archive Its 120GiB if anyone wants to try download it and see if it works. https://web.archive.org/web/20250521011912/https://zenodo.org/records/15170676/files/dataset.zst?download=1
Well the DOI is a digital identifier for papers and other data references for sciency stuff. But that DOI just points to the actual paper https://www.arxiv.org/pdf/2502.00627
Link to where the archive is https://zenodo.org/records/15170676 but its been restricted from downloading
EDIT: lol I love the Internet Archive Its 120GiB if anyone wants to try download it and see if it works.
https://web.archive.org/web/20250521011912/https://zenodo.org/records/15170676/files/dataset.zst?download=1