AI Inference Costs Are Crushing SaaS Gross Margins — Here's What to Do About It

EPISODE · Apr 21, 2026 · 5 MIN

AI Inference Costs Are Crushing SaaS Gross Margins — Here's What to Do About It

from SaaS Metrics School · host Ben Murray

Is your AI SaaS company skating on thin ice because of exploding compute costs you're not tracking? In episode #365, Ben Murray tackles one of the most pressing financial challenges facing AI-first SaaS companies: the structural margin compression caused by LLM inference costs. Traditional SaaS was built on near-zero marginal cost per customer — that era is over. If you're building on top of AI, every prompt, query, and agentic workflow is a hard COGS line that scales with revenue, and if you're not managing it, it will quietly destroy your unit economics. Why AI-first SaaS companies are running 50–60% gross margins (vs. 70–80% for legacy SaaS) — and what Bessemer data shows about AI supernovas with margins as low as 25%. How inference and compute costs differ fundamentally from traditional SaaS COGS — and why they won't scale down the way hosting costs did Why token costs vary wildly (from $1–2 per million to $30–180+ for frontier models) and how that variability makes feature-level economics a CFO priority 5 tactical ways to reduce LLM spend: model routing, prompt caching, context compaction, semantic caching, and batch processing How to set up your GL accounts and COGS tracking to allocate inference costs by feature — so you actually understand the economics of what you've built Tune in before your next board meeting — because if you're not tracking AI inference costs at the feature level, you're flying blind on your most important unit economics. Resources Mentioned The SaaS CFO: https://www.thesaascfo.com/ Ray Rike — AI to ROI Newsletter: https://ai2roi.substack.com/ Tomas Tunguz: https://tomtunguz.com/ Fungies.io — 5 Ways to Save on LLM Costs: https://fungies.io

NOW PLAYING

AI Inference Costs Are Crushing SaaS Gross Margins — Here's What to Do About It

0:00 5:59

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Copy That Converts - Entrepreneurs, Copywriting, Launch, Email Marketing, Conversion Megan Wisdom | Copywriter, Email Metrics Mentor, Marketing Strategist Are you a female entrepreneur with an online business who’s struggling to grow and nurture your audience? Do you feel like you’re not making enough sales, despite your best efforts? Do you feel confused by all the marketing jargon and just wish you had a bossy business big sister to shoot it to you straight?Hey, friend. I know you didn’t get into business to get bogged down by writing, but let’s face it, the internet is still powered by WORDS. The good news? You can harness the power of those words to connect with your ideal clients and make more sales through the magic of copywriting.In each episode, we’ll dive deep into the world of copywriting and marketing, sharing insights and strategies that will help you craft compelling messages that resonate with your audience. From understanding your ideal customer to mastering the art of storytelling, we’ll cover it all.I’m Megan Wisdom, a firstborn, Enneagram 5 copywriter who loves to help other female entrepreneurs reach their business The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! Food Tech Talk: Supply Chain Insights From Farm to Fork Trustwell Welcome to Food Tech Talk: Supply Chain Insights From Farm to Fork, a bite-sized podcast discussing the latest trends and technology in the food and supplements industries, featuring conversations with regulatory experts, quality and safety champions, and thought leaders across the industry. Together, we are on a mission to change the food and dietary supplement industry for the better.  In short snippets, guests will discuss a range of topics, from regulatory compliance to sustainable operations to food traceability and transparency along the global supply chain. To learn more about Trustwell and its SaaS technology platform that connects product formulation, nutrition analysis, and compliant labeling, with traceability, recall readiness, and supply chain transparency, please visit www.trustwell.com.   Scholarship Athlete: Tips on recruiting, training, NIL, injury prevention, mindset, and leadership Alex Molden The Scholarship Athlete Podcast is a comprehensive guide to help athletes secure an athletic scholarship through tips on recruiting, training, mindset, NIL, and leadership.This podcast helps parents develop a plan for their child's athletic journey. Let's face it..it's expensive to raise kids, especially if they're serious about sports. Having a process in place can help you save money and help your son or daughter achieve success when it comes to receiving a coveted scholarship.This podcast is for you if you frequently ask yourself: -How can I help my athlete get faster?-How do I pick a high school program that helps my athlete stand out?-What's the number one thing I need to do as a parent to help my son or daughter get a scholarship?-When should my child start lifting weights?-What GPA does my child need to have to be recruited?-What should be on my kid's highlight video?-What camps or showcases should my child attend?-When does the recruitment process begin?-How much mo
URL copied to clipboard!