EPISODE · Nov 4, 2025 · 27 MIN
Remove The Beatles From The AI Training Data
from Thinking On Paper · host Mark Fielding and Jeremy Gilbertson
You're listening to AI-generated music and don't realize it. The musicians whose work trained those models know. They check their empty bank accounts daily.99,000 new songs upload to streaming platforms every day. One in five are AI-generated (Deezer). You wouldn't play a single one at your funeral.59% of musicians use AI in some aspect of their music (Ditto Music). The question: How do real musicians get paid when AI uses their work?We break down Water & Music's research on AI music attribution. Cherie Hu, Yung Spielburg, and Alexander Flores investigated what's at stake and which companies are trying to solve it.The problems:- Session musicians, producers, songwriters—how do they get paid when AI uses their beats?- Record labels can't track which training data influenced which outputs- Proving a musician's input in a model's output is nearly impossible- Copyright law wasn't built for this- Most AI music companies scraped without permission or paymentWe talk about:- How attribution technology could work (vector matching, contribution tracking)- Which companies are building payment systems- Why Suno and Udio's approach creates legal chaos- How ethical companies like Overtune license training data and split royalties- Whether streaming platforms can detect AI-generated music- What happens when the Beatles' catalog trains a modelThe stakes: If attribution fails, AI becomes theft at industrial scale. Music becomes disposable content optimized for algorithms, not humans.Based on Water & Music's research—the best reporting on AI music economics.Share with a music lover.---Research: Water & Music (Cherie Hu, Yung Spielburg, Alexander Flores)Topics: AI music, copyright, attribution, streaming, royalties, training data–(00:00) The Intersection of Music and AI(03:26) Understanding Music Attribution(03:51) Sonic Characteristics and AI Influence(06:39) The Complexity of AI Music Generation(07:36) The Value Equation in AI Music Creation(08:08) Understanding Influence Functions in Music AI(09:44) Challenges of Attribution in AI-Generated Music(11:38) Exploring Embeddings and Their Role in Music AI(14:17) Watermarking and Its Limitations in Music Attribution(15:30) Synthetic Data and Its Implications for Music AI(17:48) Innovative Solutions for Music Rights Attribution(18:01) Distinguishing Compositional vs. Recording Contributions(19:59) The Impact of AI on the Music Industry's Inequities(23:03) Trust and Technology in Music AttributionOther ways to connect with us:Listen to every podcastFollow us on InstagramFollow us on XFollow Mark on LinkedInFollow Jeremy on LinkedInRead our SubstackEmail: [email protected]
What this episode covers
You're listening to AI-generated music and don't realize it. The musicians whose work trained those models know. They check their empty bank accounts daily.99,000 new songs upload to streaming platforms every day. One in five are AI-generated (Deezer). You wouldn't play a single one at your funeral.59% of musicians use AI in some aspect of their music (Ditto Music). The question: How do real musicians get paid when AI uses their work?We break down Water & Music's research on AI music attribution. Cherie Hu, Yung Spielburg, and Alexander Flores investigated what's at stake and which companies are trying to solve it.The problems:- Session musicians, producers, songwriters—how do they get paid when AI uses their beats?- Record labels can't track which training data influenced which outputs- Proving a musician's input in a model's output is nearly impossible- Copyright law wasn't built for this- Most AI music companies scraped without permission or paymentWe talk about:- How attribution technology could work (vector matching, contribution tracking)- Which companies are building payment systems- Why Suno and Udio's approach creates legal chaos- How ethical companies like Overtune license training data and split royalties- Whether streaming platforms can detect AI-generated music- What happens when the Beatles' catalog trains a modelThe stakes: If attribution fails, AI becomes theft at industrial scale. Music becomes disposable content optimized for algorithms, not humans.Based on Water & Music's research—the best reporting on AI music economics.Share with a music lover.---Research: Water & Music (Cherie Hu, Yung Spielburg, Alexander Flores)Topics: AI music, copyright, attribution, streaming, royalties, training data–(00:00) The Intersection of Music and AI(03:26) Understanding Music Attribution(03:51) Sonic Characteristics and AI Influence(06:39) The Complexity of AI Music Generation(07:36) The Value Equation in AI Music Creation(08:08) Understanding Influence Functions in Music AI(09:44) Challenges of Attribution in AI-Generated Music(11:38) Exploring Embeddings and Their Role in Music AI(14:17) Watermarking and Its Limitations in Music Attribution(15:30) Synthetic Data and Its Implications for Music AI(17:48) Innovative Solutions for Music Rights Attribution(18:01) Distinguishing Compositional vs. Recording Contributions(19:59) The Impact of AI on the Music Industry's Inequities(23:03) Trust and Technology in Music AttributionOther ways to connect with us:Listen to every podcastFollow us on InstagramFollow us on XFollow Mark on LinkedInFollow Jeremy on LinkedInRead our SubstackEmail: [email protected]
NOW PLAYING
Remove The Beatles From The AI Training Data
No transcript for this episode yet
Similar Episodes
Apr 22, 2025 ·32m
Feb 27, 2025 ·0m
Sep 20, 2024 ·57m
Aug 7, 2024 ·16m