GEEK HAUS
Back to feed
2026/06/20/the-atlantic-created-a-searchable-database-of-the

The Atlantic created a searchable database of the music used to train AI

·The Verge
read original
The Atlantic created a searchable database of the music used to train AI

EDITOR BRIEF

The Atlantic reporter Alex Reisner identified four music datasets used to train AI systems and made them searchable by the public. Two contain millions of tracks, while the smaller sets still include more than 100,000 songs each; Google and Stability have cited use of some datasets in research papers.

INSIGHTS

The database makes opaque AI training practices more visible, especially for artists trying to determine whether their work may have been included. It also adds pressure around music licensing and copyright as generative AI companies face growing scrutiny over training data sources.

COMMENTS

Discussion

> geekhaus:~$ next read?

Next read recommendations