PodParley PodParley
Are Evals Dead?

EPISODE · Sep 26, 2025 · 25 MIN

Are Evals Dead?

from MLOps.community · host Demetrios

AI Conversations Powered by Prosus Group  Your AI agent isn’t failing because it’s dumb—it’s failing because you refuse to test it. Chiara Caratelli cuts through the hype to show why evaluations—not bigger models or fancier prompts—decide whether agents succeed in the real world. If you’re not stress-testing, simulating, and iterating on failures, you’re not building AI—you’re shipping experiments disguised as products.Guest speaker: Chiara Caratelli - Data Scientist @ Prosus GroupHost: Demetrios Brinkmann - Founder of MLOps Community~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our Slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]

NOW PLAYING

Are Evals Dead?

0:00 25:24

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

URL copied to clipboard!