EPISODE · Jun 9, 2026 · 55 MIN
From Single-Player to Multi-Player: Operating AI Agents at Scale
from MLOps.community · host Demetrios
James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.From Single-Player to Multi-Player: Operating AI Agents at Scale // MLOps Podcast #383 with James Everingham, CEO and Co-founder of Guild.aiIn this episode, James unpacks what actually breaks when you move from a single AI agent to a fleet of them — and what engineering leaders need to build before it's too late.🎯 Single-Agent vs. Multi-Agent Systems — Why "single-player" AI workflows don't survive contact with production reality, and what the shift to multi-agent coordination actually demands from your infrastructure.🔍 The Agent Control Plane — What it is, why every engineering org needs one in 2026, and how Guild.ai is building the neutral layer to deploy, govern, and share agents across any framework or model.⚠️ Non-Determinism at Scale — Why AI agents behave like employees, not software, and why you need workforce-style governance — not just observability tooling — to manage them.💸 Token Spend & Cost Visibility — How teams running agents in production are flying blind on cost, and what Guild shows you that your current stack doesn't.🏗️ Lessons from Meta's DevMate — How Meta's AI coding agent went from experiment to submitting 50% of all diffs, and what that journey teaches every engineering leader about scaling agents safely.🚦 Agent Identity & Governance — Why every agent needs an identity, what happens when they don't have one, and how agent sprawl becomes a governance crisis fast.🔄 Sharing Agents as Infrastructure — Why Guild treats agents as shared production infrastructure rather than one-off scripts, and how that changes the economics of AI investment.🛠️ Framework Agnosticism — Why betting on a single agent framework is a losing strategy, and how to build for a multi-model, multi-framework world from day one.Essential viewing for engineering leaders, AI platform teams, and founders building production-grade agentic systems.🔗 Guild.ai: https://guild.ai🔗 James on X/Twitter: https://x.com/jevering🔗 James on LinkedIn: https://www.linkedin.com/in/jameseveringham🔗 Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/⏱️ Timestamps [00:00] Context Transfer Challenges[00:51] Control Plane for Agents[02:17] Effective Agent Policies[09:23] Agent Governance Policies[15:34] Developer Tool Adoption[22:02] Knowledge Sharing and Open Source[24:59] Simulated Deployments and Confidence[29:36] Agent Workloads vs Human Workloads[39:55] AI as a Customer[47:59] Agent Hub vs Autonomy[53:21] Wrap up#AgenticAI #AIAgents #AIEngineering
What this episode covers
James Everingham is the CEO and Co-founder of Guild.ai — the AI agent control plane for production teams. With roots at Netscape, Instagram (Head of Engineering), and Meta (Head of Dev Infra, leading a 1,000-person org), James brings rare, hard-won expertise to the challenge of operating AI agents at scale.From Single-Player to Multi-Player: Operating AI Agents at Scale // MLOps Podcast #383 with James Everingham, CEO and Co-founder of Guild.aiIn this episode, James unpacks what actually breaks when you move from a single AI agent to a fleet of them — and what engineering leaders need to build before it's too late.🎯 Single-Agent vs. Multi-Agent Systems — Why "single-player" AI workflows don't survive contact with production reality, and what the shift to multi-agent coordination actually demands from your infrastructure.🔍 The Agent Control Plane — What it is, why every engineering org needs one in 2026, and how Guild.ai is building the neutral layer to deploy, govern, and share agents across any framework or model.⚠️ Non-Determinism at Scale — Why AI agents behave like employees, not software, and why you need workforce-style governance — not just observability tooling — to manage them.💸 Token Spend & Cost Visibility — How teams running agents in production are flying blind on cost, and what Guild shows you that your current stack doesn't.🏗️ Lessons from Meta's DevMate — How Meta's AI coding agent went from experiment to submitting 50% of all diffs, and what that journey teaches every engineering leader about scaling agents safely.🚦 Agent Identity & Governance — Why every agent needs an identity, what happens when they don't have one, and how agent sprawl becomes a governance crisis fast.🔄 Sharing Agents as Infrastructure — Why Guild treats agents as shared production infrastructure rather than one-off scripts, and how that changes the economics of AI investment.🛠️ Framework Agnosticism — Why betting on a single agent framework is a losing strategy, and how to build for a multi-model, multi-framework world from day one.Essential viewing for engineering leaders, AI platform teams, and founders building production-grade agentic systems.🔗 Guild.ai: https://guild.ai🔗 James on X/Twitter: https://x.com/jevering🔗 James on LinkedIn: https://www.linkedin.com/in/jameseveringham🔗 Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/⏱️ Timestamps [00:00] Context Transfer Challenges[00:51] Control Plane for Agents[02:17] Effective Agent Policies[09:23] Agent Governance Policies[15:34] Developer Tool Adoption[22:02] Knowledge Sharing and Open Source[24:59] Simulated Deployments and Confidence[29:36] Agent Workloads vs Human Workloads[39:55] AI as a Customer[47:59] Agent Hub vs Autonomy[53:21] Wrap up#AgenticAI #AIAgents #AIEngineering
NOW PLAYING
From Single-Player to Multi-Player: Operating AI Agents at Scale
No transcript for this episode yet
Similar Episodes
Apr 21, 2026 ·13m
Apr 19, 2026 ·16m
Apr 17, 2026 ·13m
Apr 13, 2026 ·11m
Apr 11, 2026 ·16m