NVIDIA’s Open Software Trap: The Real Cost of the New Inference Stack episode artwork

EPISODE · Mar 22, 2026 · 28 MIN

NVIDIA’s Open Software Trap: The Real Cost of the New Inference Stack

from The Reasoning Show · host Massive Studios

SUMMARY: We dig into the NVIDIA GTC keynote and highlight three things - accelerated computing for everything, the complexity of the new inference stack, and NVIDIA’s “open” software stack including NemoClaw.SHOW: 1012SHOW TRANSCRIPT: The Reasoning Show #1012 TranscriptSHOW VIDEO: https://youtu.be/aXOr91q76yMSHOW SPONSORS:VENTION - Ready for expert developers who actually deliver?Visit ventionteams.comSHOW NOTES:NVIDIA GTC 2026 (Keynote)NVIDIA NemoClaw - OpenClaw + OpenShell + NVIDIA Agent ToolkitNVIDIA adds Groq LPU to their rack systemsNVIDIA to invest $26B in Open Weight ModelsInterview with Jensen about Accelerated Computing (Stratechery)Topic 1 - Jensen’s trying to paint the bigger picture of accelerated computing everywhere (robotics, autonomous driving, gen-ai, physical ai - but also just everyday enterprise apps). Everything is about keeping the stock price up, and margins high. The stock price provides the warchest to fight off all foes. Topic 2 - The inference architecture is a complex mix of GPUs, CPUs, ASICs/LPUs, high-speed networking and seems very different from the training architecture. How big is the burden on data center providers? What are the inference alternatives emerging? Topic 3 - Jensen talked a lot about OpenClaw and eventually about NVIDIA’s NemoClaw. How does his interest in Agentic AI tie into his interest in building NVIDIA’s own frontier modelFEEDBACK?Email: show @ the enterprise ai show dot comeBluesky: @EntAIShow.bsky.socialTwitter/X: @TheEntAIShowInstagram: @TheEntAIShow

NOW PLAYING

NVIDIA’s Open Software Trap: The Real Cost of the New Inference Stack

0:00 28:00

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of The Reasoning Show?

This episode is 28 minutes long.

When was this The Reasoning Show episode published?

This episode was published on March 22, 2026.

What is this episode about?

SUMMARY: We dig into the NVIDIA GTC keynote and highlight three things - accelerated computing for everything, the complexity of the new inference stack, and NVIDIA’s “open” software stack including NemoClaw.SHOW: 1012SHOW TRANSCRIPT: The Reasoning...

Can I download this The Reasoning Show episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!