Evaluating Retrieval Capabilities of Language Models [Microsoft]
Episode 105 of the Snacks Weekly on Data Science podcast, hosted by Pan Wu, titled "Evaluating Retrieval Capabilities of Language Models [Microsoft]" was published on September 29, 2025 and runs 10 minutes.
September 29, 2025 ·10m · Snacks Weekly on Data Science
Summary
In this episode, we explore how to evaluate the retrieval-augmented generation (RAG) capabilities of small language models. On the business side, we discuss why RAG, long context windows, and small language models are critical for building scalable and reliable AI systems. On the technical side, we walk through the Needle-in-a-Haystack methodology and discuss key findings about retrieval performance across different models.For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/data-science-at-microsoft/evaluating-rag-capabilities-of-small-language-models-e7531b3a5061
Episode Description
In this episode, we explore how to evaluate the retrieval-augmented generation (RAG) capabilities of small language models. On the business side, we discuss why RAG, long context windows, and small language models are critical for building scalable and reliable AI systems. On the technical side, we walk through the Needle-in-a-Haystack methodology and discuss key findings about retrieval performance across different models.
For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/data-science-at-microsoft/evaluating-rag-capabilities-of-small-language-models-e7531b3a5061
Similar Episodes
Jun 19, 2025 ·46m
Jun 13, 2025 ·40m
May 20, 2025 ·80m
May 13, 2025 ·74m
May 7, 2025 ·64m