The End of RAG (with Donato Riccio) episode artwork

EPISODE · Feb 9, 2024 · 40 MIN

The End of RAG (with Donato Riccio)

from Thinking Machines: AI & Philosophy · host Daniel Reid Cahn

ML Engineer and tech writer Donato Riccio wrote an article entitled "The End of RAG?" discussing what might replace Retrieval Augmented Generation in the near future. The article was received as highly controversial within the AI echo chamber, so I brought Donato on the podcast to discuss RAG, why people are so obsessed with vector databases, and the upcoming research in AI that might replace it.Takeaways:RAG is necessary due to LLMs' limited context window and scalability issues, and the need to avoid hallucinations and outdated information.Larger/infinite context window models and linear-scaling models (e.g. RWKV, Eagle) may allow for learning through forward propagation, allowing for more efficient and effective knowledge acquisitionAgentic flows are likely far more powerful than RAG - and when they actually start working consistently, we may see the need for vector databases dramatically reduced.RAG libraries and abstracts can be helpful for getting off the ground but don't solve the hard problems in specific vertical LLM use cases.RAG vs Agents, and the complex ways that vertical AI approach RAG in practiceShare your thoughts with us at [email protected] or tweet us @slingshot_ai.

ML Engineer and tech writer Donato Riccio wrote an article entitled "The End of RAG?" discussing what might replace Retrieval Augmented Generation in the near future. The article was received as highly controversial within the AI echo chamber, so I brought Donato on the podcast to discuss RAG, why people are so obsessed with vector databases, and the upcoming research in AI that might replace it.Takeaways:RAG is necessary due to LLMs' limited context window and scalability issues, and the need to avoid hallucinations and outdated information.Larger/infinite context window models and linear-scaling models (e.g. RWKV, Eagle) may allow for learning through forward propagation, allowing for more efficient and effective knowledge acquisitionAgentic flows are likely far more powerful than RAG - and when they actually start working consistently, we may see the need for vector databases dramatically reduced.RAG libraries and abstracts can be helpful for getting off the ground but don't solve the hard problems in specific vertical LLM use cases.RAG vs Agents, and the complex ways that vertical AI approach RAG in practiceShare your thoughts with us at [email protected] or tweet us @slingshot_ai.

NOW PLAYING

The End of RAG (with Donato Riccio)

0:00 40:13

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

AI Erik's Podcast Audio Erik Conn The AI News Podcast where we talk AI. CISO Perspectives (public) N2K Networks This season on CISO Perspectives, host Kim Jones explores some of the challenges of leading through uncertainty. We explore the complexity of the changing nature of regulation and working with the federal government, the evolution of privacy and fraud, and how emerging technologies like AI and quantum computing are changing cyber. When you don’t know what questions to ask, you’re afraid to ask, or don’t know who to ask, CISO Perspectives provides the foundation for learning in this brave new world. Rich Dad's Guide to Investing II Robert T. Kiyosaki II Full Audiobook II Robert T. Kiyosaki Investing means different things to different people… and there is a huge difference between passive investing and becoming an active, engaged investor. Rich Dad’s Guide to Investing, one of the three core titles in the Rich Dad Series, covers the basic rules of investing, how to reduce your investment risk, how to convert your earned income into passive income… plus Rich Dad’s 10 Investor Controls.The Rich Dad philosophy makes a key distinction between managing your money and growing it… and understanding key principles of investing is the first step toward creating and growing wealth. This book delivers guidance, not guarantees, to help anyone begin the process of becoming an active investor on the road to financial freedom. Westenberg Joan Westenberg The Westenberg Podcast offers ideas, explainers, book notes, and reflections on technology, philosophy, and the human experience. Hosted by Joan Westenberg, each episode unpacks complex topics with clarity and depth, blending personal insights with thought-provoking analysis. It’s a space for exploring big questions and fresh perspectives in an accessible format.

Frequently Asked Questions

How long is this episode of Thinking Machines: AI & Philosophy?

This episode is 40 minutes long.

When was this Thinking Machines: AI & Philosophy episode published?

This episode was published on February 9, 2024.

What is this episode about?

ML Engineer and tech writer Donato Riccio wrote an article entitled "The End of RAG?" discussing what might replace Retrieval Augmented Generation in the near future. The article was received as highly controversial within the AI echo chamber, so I...

Can I download this Thinking Machines: AI & Philosophy episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!