#77- Ring Attention and 1M context window, is RAG dead?
An episode of the Life with AI podcast, hosted by Filipe Lauar, titled "#77- Ring Attention and 1M context window, is RAG dead?" was published on March 7, 2024 and runs 12 minutes.
March 7, 2024 ·12m · Life with AI
Summary
Hello guys, in this episode I explain how we can scale the context window of an LLM to more than 1M tokens using Ring Attention. In the episode, I also discuss if RAG is dead or not based on these advancements in the context window. Paper Lost in the Middle: https://arxiv.org/pdf/2307.03172.pdf Gemini technical report: https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf Paper Ring Attention: https://arxiv.org/pdf/2310.01889.pdf Instagram of the podcast: https://www.instagram.com/podcast.lifewithai Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Episode Description
Hello guys, in this episode I explain how we can scale the context window of an LLM to more than 1M tokens using Ring Attention. In the episode, I also discuss if RAG is dead or not based on these advancements in the context window.
Paper Lost in the Middle: https://arxiv.org/pdf/2307.03172.pdf
Gemini technical report: https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf
Paper Ring Attention: https://arxiv.org/pdf/2310.01889.pdf
Instagram of the podcast: https://www.instagram.com/podcast.lifewithai
Linkedin of the podcast: https://www.linkedin.com/company/life-with-ai
Similar Episodes
Jan 15, 2025 ·15m
Jan 15, 2025 ·18m
Dec 8, 2023 ·20m
Oct 25, 2023 ·18m
Oct 21, 2023 ·19m
Sep 16, 2023 ·18m