EPISODE · Jan 15, 2026 · 56 MIN
Why AI Leaderboards Miss the Point
from YAAP (Yet Another AI Podcast) · host AI21
Leaderboards reward “best average score.” Real users reward “answer fast, don’t hallucinate, don’t bankrupt me.” In this special deep dive episode, AI21’s CTO Barak Lenz walks through four gaps between what models can do and what real AI systems deliver: validation, contextualization (pick the right approach per input), latency (parallelize and stop early), and decomposition (making those choices continuously inside long workflows). Less “best model.” More “best execution.”
NOW PLAYING
Why AI Leaderboards Miss the Point
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Jan 2, 2026 ·47m
Dec 21, 2025 ·46m