EPISODE · May 25, 2026 · 24 MIN
Local AI on Unraid - The Stuff Nobody Tells You
from The Uncast Show · host Unraid
Local AI on your Unraid server isn't just chatbots, it's image generation, music, voice cloning, photo recognition, automatic transcription, and so much more, all running on hardware you already own. But where do you even start? In this video I walk through the whole landscape of local AI for Unraid users. What it actually is, the two completely different types you can run, and what hardware you really need to get going. This isn't a setup guide. Think of it as your mental map of local AI, so when you do start installing things, you'll actually understand what you're doing and why.We cover generative vs predictive AI, the three reasons people run AI locally (privacy, cost, control), real working examples on my own server, and a full walkthrough of the hardware tiers from CPU to high-end GPU, including which card I'd actually recommend you buy first and why.Key Links🔗 Frigate (open source NVR with AI object detection).🔗 Frigate live demo.🔗 Speaches (Whisper + Kokoro container).🔗 Ollama.🔗 Open WebUI.🔗 ComfyUI.▶️ Related videos on the channel.🔗 A-Eye (local photo renaming with AI).🔗 Paperless AI (auto-tagging your documents).💬 Want me to make a follow-up video benchmarking all of those GPUs with proper AI benchmark software? Drop a comment below and let me know.Hardware RecsUSB Edge TPU ML Accelerator.NVIDIA RTX 3060 12GB.NVIDIA RTX 5070.⏱️ Timestamps0:00 - Cold open: what local AI can actually do2:13 - Welcome and what this video is (and isn't)2:20 - Everything you just saw was generated locally2:33 - What is local AI? Privacy, cost, control3:00 - Generative vs predictive AI explained3:25 - Frigate demo (predictive AI in action)5:02 - The Google Coral and edge AI5:22 - Quick tip: USB Coral vs PCIe Coral 5:50 - A-Eye: when predictive AI does need a GPU6:35 - Whisper + Kokoro in one container (Speaches)7:53 - Web UIs vs APIs: how local AI tools chain together8:18 - OpenClaw: my own voice assistant built on Signal9:50 - Ollama and Open WebUI for chatbots10:50 - ComfyUI for image generation11:34 - So what hardware do you actually need?12:00 - VRAM, RAM and RAM speed for AI14:02 - CPU tier15:08 - Integrated GPU tier16:07 - Dedicated GPUs: the lineup on the bench17:01 - Benchmark video CTA18:38 - Why the RTX 3060 12GB is the sweet spot20:22 - The other Nvidia cards (2060, 2080 Ti, 5070, 5090)22:06 - AMD and ROCm22:43 - Intel Arc Pro B-series for local AI23:30 - Edge AI recap24:15 - A little secret about this video...What will you build with Unraid? Get Started with Unraid in 15 minutes or less.Some of the links below are affiliate links, meaning we may earn a commission if you click through and make a purchase.
What this episode covers
Local AI on your Unraid server isn't just chatbots, it's image generation, music, voice cloning, photo recognition, automatic transcription, and so much more, all running on hardware you already own. But where do you even start? In this video I walk through the whole landscape of local AI for Unraid users. What it actually is, the two completely different types you can run, and what hardware you really need to get going. This isn't a setup guide. Think of it as your mental map of local AI, so when you do start installing things, you'll actually understand what you're doing and why.We cover generative vs predictive AI, the three reasons people run AI locally (privacy, cost, control), real working examples on my own server, and a full walkthrough of the hardware tiers from CPU to high-end GPU, including which card I'd actually recommend you buy first and why.Key Links🔗 Frigate (open source NVR with AI object detection).🔗 Frigate live demo.🔗 Speaches (Whisper + Kokoro container).🔗 Ollama.🔗 Open WebUI.🔗 ComfyUI.▶️ Related videos on the channel.🔗 A-Eye (local photo renaming with AI).🔗 Paperless AI (auto-tagging your documents).💬 Want me to make a follow-up video benchmarking all of those GPUs with proper AI benchmark software? Drop a comment below and let me know.Hardware RecsUSB Edge TPU ML Accelerator.NVIDIA RTX 3060 12GB.NVIDIA RTX 5070.⏱️ Timestamps0:00 - Cold open: what local AI can actually do2:13 - Welcome and what this video is (and isn't)2:20 - Everything you just saw was generated locally2:33 - What is local AI? Privacy, cost, control3:00 - Generative vs predictive AI explained3:25 - Frigate demo (predictive AI in action)5:02 - The Google Coral and edge AI5:22 - Quick tip: USB Coral vs PCIe Coral 5:50 - A-Eye: when predictive AI does need a GPU6:35 - Whisper + Kokoro in one container (Speaches)7:53 - Web UIs vs APIs: how local AI tools chain together8:18 - OpenClaw: my own voice assistant built on Signal9:50 - Ollama and Open WebUI for chatbots10:50 - ComfyUI for image generation11:34 - So what hardware do you actually need?12:00 - VRAM, RAM and RAM speed for AI14:02 - CPU tier15:08 - Integrated GPU tier16:07 - Dedicated GPUs: the lineup on the bench17:01 - Benchmark video CTA18:38 - Why the RTX 3060 12GB is the sweet spot20:22 - The other Nvidia cards (2060, 2080 Ti, 5070, 5090)22:06 - AMD and ROCm22:43 - Intel Arc Pro B-series for local AI23:30 - Edge AI recap24:15 - A little secret about this video...What will you build with Unraid? Get Started with Unraid in 15 minutes or less.Some of the links below are affiliate links, meaning we may earn a commission if you click through and make a purchase.
NOW PLAYING
Local AI on Unraid - The Stuff Nobody Tells You
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m