Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55 episode artwork

EPISODE · May 21, 2025 · 1H 5M

Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

from Cloud Foundry Weekly · host Nick Kuhn and Nicky Pike

Hot off the presses in model releases - we will explore the Qwen3-30b-a3b MoE model running on the Tanzu Platform. Early testing shows it performs exceptionally well on somewhat older enterprise-grade server CPUs (aka Cascade Lake). This show will provide some insights on how enterprises can use their existing server infrastructure to start their intelligent application modernization efforts.

NOW PLAYING

Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

0:00 1:05:18

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Frequently Asked Questions

How long is this episode of Cloud Foundry Weekly?

This episode is 1 hour and 5 minutes long.

When was this Cloud Foundry Weekly episode published?

This episode was published on May 21, 2025.

What is this episode about?

Hot off the presses in model releases - we will explore the Qwen3-30b-a3b MoE model running on the Tanzu Platform. Early testing shows it performs exceptionally well on somewhat older enterprise-grade server CPUs (aka Cascade Lake). This show will...

Can I download this Cloud Foundry Weekly episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!