Tech

Tech Giants Face AI Token Budget Crisis as Costs Outpace Efficiency Gains

As autonomous agents drive token consumption up 18.6 times in nine months, enterprises are pivoting from rapid adoption to strict cost governance, prompting a new market for spend management tools and standardised metrics.

Author
Owen Mercer
Markets and Finance Editor
Published
Draft
Source: TechCrunch · original
The token bill comes due: Inside the industry scramble to manage AI’s runaway costs
Uber, Microsoft, and Priceline among firms exceeding 2026 forecasts by April; Linux Foundation launches standards body to tame runaway spending

Major technology companies, including Uber, Microsoft, and Priceline, are grappling with significant budget overruns driven by escalating AI token consumption, marking a sharp pivot from the industry’s previous focus on rapid adoption to urgent cost control. Despite a decline in per-token prices, the proliferation of autonomous AI agents has caused usage volumes to surge, with some organisations exceeding their entire 2026 budgets by April. This financial strain has forced a recalibration of corporate strategy, as leaders move away from unlimited experimentation towards implementing guardrails and auditing expenditures.

The scale of the overspend has been stark, with Uber exhausting its full 2026 AI coding budget by April and Microsoft revoking developers’ Claude Code licenses months after initially enabling them. At Priceline, a routine contract renewal for the coding assistant Cursor returned at four to five times the expected cost. The situation has become so severe that one company reportedly incurred a $500 million bill for Claude after failing to set usage limits for employees, highlighting the risks of unmanaged access to powerful models.

Data from engineering management platforms underscores the disconnect between spending and productivity. A March survey by Faros AI of 20,000 developers found that while output was rising, so were bugs and rewrites. Similarly, Jellyfish data indicated that engineers using the most tokens were only about twice as productive as their peers but consumed 10 times the number of tokens to achieve that result. Nicholas Arcolano, head of research at Jellyfish, noted that per-developer AI consumption rose approximately 18.6 times over a nine-month period, largely due to agentic features.

In response to this opacity and financial pressure, the Linux Foundation is establishing the Tokenomics Foundation to create standardised metrics and frameworks for AI token economics. The initiative aims to instil the same cost discipline around AI tokens that FinOps practices brought to cloud spending. J.R. Storment, executive director of the FinOps Foundation, reported hearing from companies in April and May that they were already three times over their 2026 token budgets. The foundation plans a formal launch in July and will announce additional members at the upcoming FinOps X conference.

A market for AI spend management tools is rapidly emerging to meet enterprise demand for visibility and auditability. Vendors such as Pay-i, Paid, and existing players like Datadog and New Relic are introducing services to track token-level usage and optimise model selection. Salesforce chief availability officer Nishant Gupta described token economics as fundamentally more abstract and opaque than previous cloud cost management challenges, requiring a different operational approach. With Goldman Sachs projecting global token usage to multiply by 24 times by 2030, the industry is racing to build the necessary infrastructure to manage this exponential growth.

Continue reading

More from Tech

Read next: Apple opens developer access to iOS, iPadOS and macOS 27 betas
Read next: Apple confirms macOS 27 Golden Gate requires Apple Silicon, ending Intel support
Read next: Apple unveils watchOS 27 with Siri AI integration and hardware restrictions