Building Enterprise RAG: Lessons from 2+ Years of Production Deployments episode artwork

EPISODE · Jul 1, 2025 · 37 MIN

Building Enterprise RAG: Lessons from 2+ Years of Production Deployments

from YAAP (Yet Another AI Podcast) · host AI21

Building production AI systems is hard — especially when you're pioneering entirely new categories. In this episode, Yuval speaks with Guy Becker, Group Product Manager at AI21, to trace the evolution from task-specific models to Agent planning and orchestration systems. Guy shares hard-won lessons from building some of the first RAG-as-a-service offerings when there were literally zero handbooks to follow. Key Topics: Task-specific models vs. general LLMs: Why focused, smaller models with pre and post-processing beat general purpose LLMs for business use cases. Building RAG before it was cool: Creating one of the first RAG-as-a-service platforms in early 2023 without any established patterns. The one-size-fits-all problem: Why chunking strategies, embedding models, and retrieval parameters need customization per use case. From SaaS to on-prem: Scaling deployment models for enterprise customers with sensitive data. When RAG breaks down: Multi-hop queries, metadata filtering, and why semantic search isn't always enough. Multi-agent orchestration: How AI21 Maestro uses automated planning to break complex queries into parallelizable subtasks. Production lessons: Evaluation strategies, quality guarantees, and building explainable AI systems for enterprise..

NOW PLAYING

Building Enterprise RAG: Lessons from 2+ Years of Production Deployments

0:00 37:57

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

That Hoarder: Overcome Compulsive Hoarding That Hoarder Hoarding disorder is stigmatised and people who hoard feel vast amounts of shame. This podcast began life as an audio diary, an anonymous outlet for somebody with this weird condition. That Hoarder speaks about her experiences living with compulsive hoarding, she interviews therapists, academics, researchers, children of hoarders, professional organisers and influencers, and she shares insight and tips for others with the problem. Listened to by people who hoard as well as those who love them and those who work with them, Overcome Compulsive Hoarding with That Hoarder aims to shatter the stigma, share the truth and speak openly and honestly to improve lives. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! DIOSA. Carolina Sanper This podcast is a sacred space created by Carolina Sanper where you connect with your inner wisdom and embody your magnetic feminine power.It is the realization that the mystical realm is where you plant the seeds of your desired reality.It is a portal to your true essence: awareness, presence, and receiving with ease. Welcome home, DIOSA. 🖤 XXX Tech by SOVRYN Dr. Brian Sovryn The crossroads between technology, sensuality, and metaphysics - and the longest running anarchist podcast in the world! Brought to you by Dr. Brian Sovryn.

Frequently Asked Questions

How long is this episode of YAAP (Yet Another AI Podcast)?

This episode is 37 minutes long.

When was this YAAP (Yet Another AI Podcast) episode published?

This episode was published on July 1, 2025.

What is this episode about?

Building production AI systems is hard — especially when you're pioneering entirely new categories. In this episode, Yuval speaks with Guy Becker, Group Product Manager at AI21, to trace the evolution from task-specific models to Agent planning and...

Can I download this YAAP (Yet Another AI Podcast) episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!