Long-Context LLMs Meet RAG episode artwork

EPISODE · Oct 18, 2024 · 15 MIN

Long-Context LLMs Meet RAG

from LlamaCast · host Shahriar Shariati

📈 Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAGThis paper explores the challenges and opportunities of using long-context language models (LLMs) in retrieval-augmented generation (RAG) systems. While increasing the number of retrieved passages initially improves performance, the authors find that it eventually degrades due to the introduction of irrelevant information, or "hard negatives." To address this, the paper proposes three methods for enhancing the robustness of RAG with long-context LLMs: retrieval reordering, RAG-specific implicit LLM fine-tuning, and RAG-oriented LLM fine-tuning with intermediate reasoning. The paper also investigates the impact of various factors related to data distribution, retriever selection, and training context length on the effectiveness of RAG-specific tuning.📎 Link to paper

📈 Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAGThis paper explores the challenges and opportunities of using long-context language models (LLMs) in retrieval-augmented generation (RAG) systems. While increasing the number of retrieved passages initially improves performance, the authors find that it eventually degrades due to the introduction of irrelevant information, or "hard negatives." To address this, the paper proposes three methods for enhancing the robustness of RAG with long-context LLMs: retrieval reordering, RAG-specific implicit LLM fine-tuning, and RAG-oriented LLM fine-tuning with intermediate reasoning. The paper also investigates the impact of various factors related to data distribution, retriever selection, and training context length on the effectiveness of RAG-specific tuning.📎 Link to paper

NOW PLAYING

Long-Context LLMs Meet RAG

0:00 15:36

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

No similar episodes found.

No similar podcasts found.

Frequently Asked Questions

How long is this episode of LlamaCast?

This episode is 15 minutes long.

When was this LlamaCast episode published?

This episode was published on October 18, 2024.

What is this episode about?

📈 Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAGThis paper explores the challenges and opportunities of using long-context language models (LLMs) in retrieval-augmented generation (RAG) systems. While increasing the number...

Can I download this LlamaCast episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!