Activist Group Claims to Have Scraped 86 Million Music Fi...
Tech Beetle briefing GB

Activist Group Claims to Have Scraped 86 Million Music Files from Spotify

Essential brief

Activist Group Claims to Have Scraped 86 Million Music Files from Spotify

Key facts

Anna’s Archive claims to have scraped 86 million music files and extensive metadata from Spotify using illicit methods.
Spotify is investigating the breach, has disabled offending accounts, and states the leak does not cover its full catalog.
The leaked music could be used to train AI models, raising concerns about copyright infringement in AI development.
The incident underscores ongoing debates over copyright law and AI training data usage, with governments considering new regulations.
Creative professionals largely oppose allowing AI companies to use copyrighted works without explicit permission.

Highlights

Anna’s Archive claims to have scraped 86 million music files and extensive metadata from Spotify using illicit methods.
Spotify is investigating the breach, has disabled offending accounts, and states the leak does not cover its full catalog.
The leaked music could be used to train AI models, raising concerns about copyright infringement in AI development.
The incident underscores ongoing debates over copyright law and AI training data usage, with governments considering new regulations.

An activist group named Anna’s Archive has announced that it scraped 86 million music files and 256 million rows of metadata from Spotify, one of the world's largest music streaming platforms.

Spotify, which hosts over 100 million tracks and serves more than 700 million users globally, confirmed that the leak does not encompass its entire catalog.

The company stated it has disabled the user accounts responsible for the unauthorized scraping and is actively investigating the incident.

According to Spotify, the third party used illicit methods to bypass digital rights management (DRM) protections to access some audio files, though the stolen music has not yet been publicly released.

Anna’s Archive, previously known for sharing pirated books, claims the scraped audio represents 99.6% of all music listened to on Spotify and intends to distribute the files via torrents to create a “preservation archive” for music.

The group frames its mission as protecting humanity’s musical heritage from threats such as natural disasters and censorship.

Experts warn that such a leak could accelerate the use of pirated music to train artificial intelligence (AI) models, a practice already common in the AI industry.

Ed Newton-Rex, a composer and copyright advocate, emphasized the need for transparency from AI companies regarding their training data.

The controversy over copyright and AI training data has intensified, with governments and creative professionals debating whether AI firms should be allowed to use copyrighted works without explicit permission.

In the UK, most respondents to a government consultation opposed proposals that would permit AI companies to use copyrighted content unless owners opt out.

The government plans to finalize AI and copyright policies by March 18, 2026.

This incident highlights ongoing tensions between protecting artists’ rights and the rapid development of AI technologies reliant on vast datasets, some of which may be obtained through unauthorized means.