- cross-posted to:
- [email protected]
- [email protected]
- [email protected]
- cross-posted to:
- [email protected]
- [email protected]
- [email protected]
We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB), grouped by popularity.
This release includes the largest publicly available music metadata database with 256 million tracks and 186 million unique ISRCs.
It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.



I’d wager 70% of what’s on Spotify is not worth preserving since its AI slop.
Interestingly enough, with the data they provide, figuring out how much of it is AI slop wouldn’t be that hard I think
Yeah as with most of the internet, it’s only worth downloading anything uploaded before 2023.
So far, LLMs have done so much more harm than help.
I’m not convinced AI slop can compete with the back log of organic slop personally.
But yeah a fuckton is probably slop either way