The fastest agent in the race has the best evals episode artwork

EPISODE · Nov 14, 2025 · 32 MIN

The fastest agent in the race has the best evals

from The Stack Overflow Podcast

Ryan welcomes Benjamin Klieger, lead engineer at Groq, to explore the infrastructure behind AI agents, how you can turn a one-minute agent into a ten-second agent, and how they used fast inference and effective evals to build their efficient and reliable Compound agent. Episode notes: Groq delivers fast, low-cost inference using their custom-designed LPU, the first chip built for inference. Check out their agent, Compound, which can search the web and run code.Connect with Benjamin on LinkedIn and X. Congrats to user Bart Kiers for winning a Stellar Answer badge on their response to Regular expression to match a line that doesn't contain a word. See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Ryan welcomes Benjamin Klieger, lead engineer at Groq, to explore the infrastructure behind AI agents, how you can turn a one-minute agent into a ten-second agent, and how they used fast inference and effective evals to build their efficient and reliable Compound agent. Episode notes: Groq delivers fast, low-cost inference using their custom-designed LPU, the first chip built for inference. Check out their agent, Compound, which can search the web and run code.Connect with Benjamin on LinkedIn and X. Congrats to user Bart Kiers for winning a Stellar Answer badge on their response to Regular expression to match a line that doesn't contain a word.  See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

NOW PLAYING

The fastest agent in the race has the best evals

0:00 32:33

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Stack Overflow Podcast?

This episode is 32 minutes long.

When was this The Stack Overflow Podcast episode published?

This episode was published on November 14, 2025.

What is this episode about?

Ryan welcomes Benjamin Klieger, lead engineer at Groq, to explore the infrastructure behind AI agents, how you can turn a one-minute agent into a ten-second agent, and how they used fast inference and effective evals to build their efficient and...

Can I download this The Stack Overflow Podcast episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!