EPISODE · Jul 21, 2025 · 29 MIN
AI Testing and Evaluation: Reflections
from Microsoft Research Podcast · host Researchers across the Microsoft research community
In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/
NOW PLAYING
AI Testing and Evaluation: Reflections
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Jan 2, 2026 ·47m
Dec 21, 2025 ·46m