EPISODE · Jun 14, 2026 · 9 MIN
Why AI Models Are Now Training on Synthetic Data
from AI Business with Fexingo: Artificial Intelligence Companies, Models, and Enterprise Adoption · host Fexingo
Episode 51 of AI Business with Fexingo explores the shift from human-annotated to synthetic training data. Lucas and Luna break down a June 2026 report showing that 60% of AI model training data is now AI-generated. They discuss Anthropic's suspension of new model access in India, the KPMG hallucination scandal, and why companies like OpenAI and Meta are turning to synthetic data despite risks like model collapse. Specific examples include Microsoft's Phi-4 model and the concept of 'distillation loops.' The hosts also address the economics: synthetic data cuts annotation costs by up to 90%, but introduces new quality control challenges. Tune in for a grounded look at how AI is learning from itself. #SyntheticData #AITraining #Anthropic #KPMG #ModelCollapse #DataAnnotation #MicrosoftPhi4 #OpenAI #Meta #Distillation #AIQuality #TrainingData #Business #Technology #FexingoBusiness #BusinessPodcast #AI #June2026 Keep every episode free: buymeacoffee.com/fexingo
What this episode covers
Episode 51 of AI Business with Fexingo explores the shift from human-annotated to synthetic training data. Lucas and Luna break down a June 2026 report showing that 60% of AI model training data is now AI-generated. They discuss Anthropic's suspension of new model access in India, the KPMG hallucination scandal, and why companies like OpenAI and Meta are turning to synthetic data despite risks like model collapse. Specific examples include Microsoft's Phi-4 model and the concept of 'distillation loops.' The hosts also address the economics: synthetic data cuts annotation costs by up to 90%, but introduces new quality control challenges. Tune in for a grounded look at how AI is learning from itself. #SyntheticData #AITraining #Anthropic #KPMG #ModelCollapse #DataAnnotation #MicrosoftPhi4 #OpenAI #Meta #Distillation #AIQuality #TrainingData #Business #Technology #FexingoBusiness #BusinessPodcast #AI #June2026 Keep every episode free: buymeacoffee.com/fexingo
NOW PLAYING
Why AI Models Are Now Training on Synthetic Data
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m