We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB). It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.
They just need to say they are using the archive for AI training data. Then it’s legal.
Well, they do sell access to their data to train AI, so that’s a start
They released the scraped data though and are openly against copyright laws