Tokenmaxxing and the Corporate AI Pullback

from Down to Business English · host Skip Montreux

AI tools were expected to help companies work faster, spend less money, and become more productive. But what happens when employees use so much AI that costs become too high? In this episode, Skip Montreux and Dez Morgan look at tokenmaxxing — a new business problem where AI costs grow much more than expected and why some companies are reducing their AI use. They start by explaining what tokens are and why they are important. Many AI companies charge businesses based on the number of tokens their employees use. When employees use too many tokens, AI costs can increase very quickly. Skip then explains how agentic AI is different from normal AI prompts. Instead of doing one task, agentic AI can work more independently. It can search for information, make decisions, check results, and repeat tasks many times. This can be very useful, but it can also use a lot of computing power and become expensive. Next, they discuss several large companies. Uber reportedly spent its yearly AI budget in only four months, which led to strict monthly token limits for developers. Amazon stopped an internal AI leaderboard, and Microsoft canceled many internal Claude Code licenses after AI costs increased too quickly. Finally, Skip and Dez talk about the bigger business impact. Companies are no longer focusing only on how much AI employees use. Instead, they want to measure how much useful work AI produces. This idea is called Inference Yield. This change could have a big effect on AI companies, especially companies like Anthropic and OpenAI as they prepare for possible future IPOs. This episode helps listeners understand the business costs of using AI while building practical Business English skills. In this episode, you will learn: How token-based AI pricing can lead to unexpected costs for companies. Why agentic AI can use many more tokens than normal AI prompts. How companies like Uber, Amazon, and Microsoft are dealing with high AI usage. Why businesses are focusing more on useful AI results than on AI activity. How limits on AI spending could affect the future value of major AI companies. Do you like what you hear? Become a D2B Member today for to access to our -- NEW!!!-- interactive audio scripts, PDF Audio Script Library, Bonus Vocabulary episodes, and D2B Member-only episodes. Visit d2benglish.com/membership for more information. Follow Down to Business English on Apple podcasts, rate the show, and leave a comment. Contact Skip, Dez, and Samantha at [email protected] Follow Skip & Dez Skip Montreux on Linkedin Skip Montreux on Instagram Skip Montreux on Twitter Skip Montreux on Facebook Dez Morgan on Twitter RSS Feed

What this episode covers

AI tools were expected to help companies work faster, spend less money, and become more productive. But what happens when employees use so much AI that costs become too high? In this episode, Skip Montreux and Dez Morgan look at tokenmaxxing — a new business problem where AI costs grow much more than expected and why some companies are reducing their AI use.  They start by explaining what tokens are and why they are important. Many AI companies charge businesses based on the number of tokens their employees use. When employees use too many tokens, AI costs can increase very quickly. Skip then explains how agentic AI is different from normal AI prompts. Instead of doing one task, agentic AI can work more independently. It can search for information, make decisions, check results, and repeat tasks many times. This can be very useful, but it can also use a lot of computing power and become expensive. Next, they discuss several large companies. Uber reportedly spent its yearly AI budget in only four months, which led to strict monthly token limits for developers. Amazon stopped an internal AI leaderboard, and Microsoft canceled many internal Claude Code licenses after AI costs increased too quickly. Finally, Skip and Dez talk about the bigger business impact. Companies are no longer focusing only on how much AI employees use. Instead, they want to measure how much useful work AI produces. This idea is called Inference Yield. This change could have a big effect on AI companies, especially companies like Anthropic and OpenAI as they prepare for possible future IPOs. D2B 414 explains how tokenmaxxing has become a serious warning sign for companies using AI at scale. What begins as a story about developers using too many tokens quickly becomes a larger question about budgets, productivity, return on investment (ROI), and whether AI tools are creating enough useful work to justify their cost. Do you like what you hear? Become a D2B Member today for to access to member-only episodes, our -- NEW!!! -- INTERACTIVE AUDIO SCRIPTS, PDF Audio Script Library, Bonus Vocabulary episodes, and D2B Member-only episodes. Visit d2benglish.com/membership for more information.

NOW PLAYING

Tokenmaxxing and the Corporate AI Pullback

0:00 25:16

1×

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

I'm ok

Mar 26, 2026 ·1m

Food Saved My Life

Mar 19, 2026 ·34m

Eat More Vegetables: The 4 Foods That Beat Ozempic (Naturally)

Feb 18, 2026 ·11m

How to End Heart Disease with Dr. Fuhrman

Feb 11, 2026 ·45m

Revolutionizing Breast Health: QT Imaging, Overdiagnosis, and What to Do Instead

Jan 27, 2026 ·35m

REMIX: Why we over-shop and compulsively acquire, and how to stop, with Dr Jan Eppingstall

Jan 9, 2026 ·61m

Similar Podcasts

MG Show MG Show The MG Show, hosted by Jeffrey Pedersen and Shannon Townsend, is a leading alternative media platform dedicated to uncovering the truth behind today’s most pressing political issues. Launched in 2019, the show has grown exponentially, offering unfiltered insights, comprehensive research, and real-time analysis. With a commitment to independent journalism and factual integrity, the MG Show empowers its audience with knowledge and encourages active participation in the political discourse. Breaking News Show | eTurboNews Juergen Thomas Steinmetz News is relevant to the global travel and tourism industry, human rights and global issues.Breaking news when it happens and only from the source. Eat to Live Jenna Fuhrman, Dr. Fuhrman Our health is our most precious gift and smart nutrition can change your life. Each month, join Dr. Fuhrman and his daughter, Jenna Fuhrman as they discuss important topics in the world of nutrition. Eat to Live will change the way you eat and think about food. French Your Way Jessica: Native French teacher founder of French Your Way Boost your French listening skills and test your comprehension with this one of a kind series of podcasts. Get the chance to listen to a real conversation between native speakers talking at normal speed AND customise your learning experience through carefully designed sets of questions (2 levels of difficulty) available for download at www.frenchvoicespodcast.com. All interviews also come with the transcript. French teacher Jessica interviews native speakers of French from around the world who share a bit of their life and passion. Where else would you meet in one same place a French yoga teacher based in Melbourne, a soap manufacturer from Provence, or a couple cycling around the world?

Frequently Asked Questions

How long is this episode of Down to Business English?

This episode is 25 minutes long.

When was this Down to Business English episode published?

This episode was published on June 13, 2026.

What is this episode about?

Can I download this Down to Business English episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.

URL copied to clipboard!