EPISODE · Jun 13, 2026 · 25 MIN
Tokenmaxxing and the Corporate AI Pullback
from Down to Business English · host Skip Montreux
AI tools were expected to help companies work faster, spend less money, and become more productive. But what happens when employees use so much AI that costs become too high? In this episode, Skip Montreux and Dez Morgan look at tokenmaxxing — a new business problem where AI costs grow much more than expected and why some companies are reducing their AI use. They start by explaining what tokens are and why they are important. Many AI companies charge businesses based on the number of tokens their employees use. When employees use too many tokens, AI costs can increase very quickly. Skip then explains how agentic AI is different from normal AI prompts. Instead of doing one task, agentic AI can work more independently. It can search for information, make decisions, check results, and repeat tasks many times. This can be very useful, but it can also use a lot of computing power and become expensive. Next, they discuss several large companies. Uber reportedly spent its yearly AI budget in only four months, which led to strict monthly token limits for developers. Amazon stopped an internal AI leaderboard, and Microsoft canceled many internal Claude Code licenses after AI costs increased too quickly. Finally, Skip and Dez talk about the bigger business impact. Companies are no longer focusing only on how much AI employees use. Instead, they want to measure how much useful work AI produces. This idea is called Inference Yield. This change could have a big effect on AI companies, especially companies like Anthropic and OpenAI as they prepare for possible future IPOs. This episode helps listeners understand the business costs of using AI while building practical Business English skills. In this episode, you will learn: How token-based AI pricing can lead to unexpected costs for companies. Why agentic AI can use many more tokens than normal AI prompts. How companies like Uber, Amazon, and Microsoft are dealing with high AI usage. Why businesses are focusing more on useful AI results than on AI activity. How limits on AI spending could affect the future value of major AI companies. Do you like what you hear? Become a D2B Member today for to access to our -- NEW!!!-- interactive audio scripts, PDF Audio Script Library, Bonus Vocabulary episodes, and D2B Member-only episodes. Visit d2benglish.com/membership for more information. Follow Down to Business English on Apple podcasts, rate the show, and leave a comment. Contact Skip, Dez, and Samantha at [email protected] Follow Skip & Dez Skip Montreux on Linkedin Skip Montreux on Instagram Skip Montreux on Twitter Skip Montreux on Facebook Dez Morgan on Twitter RSS Feed
What this episode covers
AI tools were expected to help companies work faster, spend less money, and become more productive. But what happens when employees use so much AI that costs become too high? In this episode, Skip Montreux and Dez Morgan look at tokenmaxxing — a new business problem where AI costs grow much more than expected and why some companies are reducing their AI use. They start by explaining what tokens are and why they are important. Many AI companies charge businesses based on the number of tokens their employees use. When employees use too many tokens, AI costs can increase very quickly. Skip then explains how agentic AI is different from normal AI prompts. Instead of doing one task, agentic AI can work more independently. It can search for information, make decisions, check results, and repeat tasks many times. This can be very useful, but it can also use a lot of computing power and become expensive. Next, they discuss several large companies. Uber reportedly spent its yearly AI budget in only four months, which led to strict monthly token limits for developers. Amazon stopped an internal AI leaderboard, and Microsoft canceled many internal Claude Code licenses after AI costs increased too quickly. Finally, Skip and Dez talk about the bigger business impact. Companies are no longer focusing only on how much AI employees use. Instead, they want to measure how much useful work AI produces. This idea is called Inference Yield. This change could have a big effect on AI companies, especially companies like Anthropic and OpenAI as they prepare for possible future IPOs. D2B 414 explains how tokenmaxxing has become a serious warning sign for companies using AI at scale. What begins as a story about developers using too many tokens quickly becomes a larger question about budgets, productivity, return on investment (ROI), and whether AI tools are creating enough useful work to justify their cost. Do you like what you hear? Become a D2B Member today for to access to member-only episodes, our -- NEW!!! -- INTERACTIVE AUDIO SCRIPTS, PDF Audio Script Library, Bonus Vocabulary episodes, and D2B Member-only episodes. Visit d2benglish.com/membership for more information.
NOW PLAYING
Tokenmaxxing and the Corporate AI Pullback
No transcript for this episode yet
Similar Episodes
Mar 26, 2026 ·1m
Mar 19, 2026 ·34m
Feb 18, 2026 ·11m
Feb 11, 2026 ·45m