EPISODE · Mar 14, 2026 · 22 MIN
Why long context make AI dumber
from Build Wiz AI Show · host Build Wiz AI
Forget the needle in the haystack—can your AI actually sculpt an answer from a mountain of data? This episode explores the "Michelangelo" framework, a new evaluation that challenges models to "chisel away" irrelevant noise to reveal the latent structure hidden within massive contexts. Discover how frontier models like Gemini, GPT-4o, and Claude 3.5 Sonnet stack up in these grueling reasoning tasks and why even the "smartest" models face a sharp performance drop long before reaching the million-token mark.
What this episode covers
Forget the needle in the haystack—can your AI actually sculpt an answer from a mountain of data? This episode explores the "Michelangelo" framework, a new evaluation that challenges models to "chisel away" irrelevant noise to reveal the latent structure hidden within massive contexts. Discover how frontier models like Gemini, GPT-4o, and Claude 3.5 Sonnet stack up in these grueling reasoning tasks and why even the "smartest" models face a sharp performance drop long before reaching the million-token mark.
NOW PLAYING
Why long context make AI dumber
No transcript for this episode yet
Similar Episodes
No similar episodes found.