EPISODE · Jun 5, 2026 · 8 MIN
Why Linux AI Servers Need Real-Time Kernels Now
from The Linux Podcast with Fexingo: Open Source Operating Systems, Distros, and Server Stack · host Fexingo
Episode 32 of The Linux Podcast with Fexingo dives into a growing tension in the Linux ecosystem: AI inference at the edge and in data centers demands deterministic latency, but standard Linux kernels prioritize throughput over real-time guarantees. Lucas and Luna explore why the PREEMPT_RT patch set, merged into the mainline kernel in 2024, is suddenly getting serious attention from NVIDIA, Canonical, and Red Hat. They break down a concrete example: a self-driving car stack running on an NVIDIA Orin system-on-chip, where a jitter spike of just 10 milliseconds can mean a missed sensor fusion deadline. The episode explains how real-time Linux works under the hood, why the audio and industrial automation worlds have used it for years, and what changes when AI inference meets hard deadlines. No hype, just the architecture — and why this matters for anyone building Linux-based AI systems in 2026. #Linux #RealTimeLinux #PREEMPT_RT #AI #EdgeInference #NVIDIA #Canonical #RedHat #Orin #SelfDrivingCars #Kernel #Latency #Jitter #Technology #OpenSource #FexingoBusiness #BusinessPodcast #TechPodcast Keep every episode free: buymeacoffee.com/fexingo
What this episode covers
Episode 32 of The Linux Podcast with Fexingo dives into a growing tension in the Linux ecosystem: AI inference at the edge and in data centers demands deterministic latency, but standard Linux kernels prioritize throughput over real-time guarantees. Lucas and Luna explore why the PREEMPT_RT patch set, merged into the mainline kernel in 2024, is suddenly getting serious attention from NVIDIA, Canonical, and Red Hat. They break down a concrete example: a self-driving car stack running on an NVIDIA Orin system-on-chip, where a jitter spike of just 10 milliseconds can mean a missed sensor fusion deadline. The episode explains how real-time Linux works under the hood, why the audio and industrial automation worlds have used it for years, and what changes when AI inference meets hard deadlines. No hype, just the architecture — and why this matters for anyone building Linux-based AI systems in 2026. #Linux #RealTimeLinux #PREEMPT_RT #AI #EdgeInference #NVIDIA #Canonical #RedHat #Orin #SelfDrivingCars #Kernel #Latency #Jitter #Technology #OpenSource #FexingoBusiness #BusinessPodcast #TechPodcast Keep every episode free: buymeacoffee.com/fexingo
NOW PLAYING
Why Linux AI Servers Need Real-Time Kernels Now
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m