Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale

EPISODE · Feb 10, 2026 · 57 MIN

Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale

from MLOps.community · host Demetrios

Ereli Eran is the Founding Engineer at 7AI, where he’s focused on building and scaling the company’s agentic AI-driven cybersecurity platform — developing autonomous AI agents that triage alerts, investigate threats, enrich security data, and enable end-to-end automated security operations so human teams can focus on higher-value strategic work.Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale // MLOps Podcast #361 with Ereli Eran, Founding Engineer at 7AIJoin the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletterMLOps GPU Guide: https://go.mlops.community/gpuguide// AbstractA conversation on how AI coding agents are changing the way we build and operate production systems. We explore the practical boundaries between agentic and deterministic code, strategies for shared responsibility across models, engineering teams, and customers, and how to evaluate agent performance at scale. Topics include production quality gates, safety and cost tradeoffs, managing long-tail failures, and deployment patterns that let you ship agents with confidence.// BioEreli Eran is a founding engineer at 7AI, where he builds agentic AI systems for security operations and the production infrastructure that powers them. His work spans the full stack - from designing experiment frameworks for LLM-based alert investigation to architecting secure multi-tenant systems with proper authentication boundaries. Previously, he worked in data science and software engineering roles at Stripe, VMware Carbon Black, and was an early employee of Ravelin and Normalyze.// Related LinksWebsite: https://7ai.com/Coding Agents Conference: https://luma.com/codingagents~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our Slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with Ereli on LinkedIn: /erelieran/Timestamps:[00:00] Language Sensitivity in Reasoning[00:25] Value of Claude Code[01:54] AI in Security Workflows[06:21] Agentic Systems Failures[12:50] Progressive Disclosure in Voice Agents[16:39] LLM vs Classic ML[19:44] Hybrid Approach to Fraud[25:58] Debugging with User Feedback[33:52] Prompts as Code[42:07] LLM Security Workflow[45:10] Shared Memory in Security[49:11] Common Agent Failure Modes[53:34] Wrap up

NOW PLAYING

Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale

0:00 57:24

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Photo Breakdown Scott Wyden Kivowitz Photo Breakdown is a podcast in which we explore the world of photography with a trusted guide, host Scott Wyden Kivowitz. His expertise and passion bring the industry to life as we explore the stories, trends, and ideas shaping it today. Join us as we dissect everything from incredible photographs and creative techniques to the latest gear releases and hot topics in the photography community.In each episode, we break down what’s happening behind the scenes - whether it’s making a powerful image, a candid discussion on industry trends, or a reflection on the tools and technology changing how we make photographs. You’ll get insights, expert opinions, and a fresh perspective on what’s top of mind for photographers right now.Anticipate short, engaging episodes brimming with ideas and inspiration. Be part of the conversation by sharing your thoughts, voice notes, and comments. Your participation is what makes our community vibrant and dynamic.It’s more than just photography - everyth Popup Chinese Popup Chinese Fresh from Beijing, PopupChinese teaches Chinese as it is actually spoken. Start with our basic Chinese lessons, and in no time you'll be speaking like a Beijinger. Our free daily podcasts, vibrant community, and love for the real China make us the most powerful and personal way to learn mandarin. Linux Game Cast on Odysee Linux Game Cast Helping the Linux community with gaming, podcasting, live streaming, and audio & video production since 2010. [LinuxGameCast Webzone](https://linuxgamecast.com/) She’s a Hazard to Herself She’s a Hazard Hi there, I’m Mallory, and I’d like to invite you into our world with “She’s a Hazard to Herself!” Join us as we navigate life with Multiple Sclerosis from the seat of my power wheelchair. Discover stories of resilience, family, and the community we’ve built around chronic illness. Whether you’re impacted by MS or want to learn from our journey, there’s something here for you. So why wait? Subscribe to “She’s a Hazard to Herself” on your favorite podcast app and be part of our journey today. Let’s lift each other up, one episode at a time!
URL copied to clipboard!