Why AI token prices are about to plummet

9 hours ago 11

Nvidia CEO Jensen Huang shows off the company's Blackwell system on stage.

Nvidia CEO Jensen Huang shows off the company's Blackwell system on stage. Ann Wang/REUTERS

I had lunch with the CEO of an AI infrastructure company recently. I can't tell you their name, but they said something that really caught my attention: There will be a crop of new AI models later this year that will be a lot better and more efficient.

This will likely make AI tokens more abundant and radically cheaper. (Tokens are the basic units models use to process information, and the standard way AI use is measured and priced).

Hand-wringing about tokenmaxxing could die down. Or, users could go on another bender and burn even more tokens with abandon.

Either way, the price of tokens is probably about to plummet. This is why we already see some AI model providers slashing prices, and other players talking about doing so.

OpenAI CEO Sam Altman recently said AI costs had become a huge issue, adding that the startup will have "a lot of ways we can help people get more value for less spend."

This trend may already be showing up in the data. A closely watched token spending index run by Silicon Data peaked at around 2.06 in late May and fell to 1.75 as of June 10.

Carmen Li, the CEO of Silicon Data, told me this could mean token prices are dropping across many AI models.

Blackwell finally emerges

The main force driving token prices lower is a new wave of technology that's sweeping through AI data centers.

Nvidia's Blackwell GPUs are being installed in huge volumes right now. By the second half of this year, these systems, which are really supercomputers rather than chips, will be operating at scale, helping AI labs train new models and run them more efficiently.

These systems took a while to install properly, partly because they needed to be water-cooled and required other gnarly new data center setups. But the payoff could be huge.

50 x more, 35 x cheaper

SemiAnalysis, a respected AI research firm, compared Nvidia's top Blackwell system, the GB 300 NVL72, to Nvidia's previous system, called the Hopper HGX 200.

With the older system, each GPU generated 90 tokens per second, while the new Blackwell system generated 6,000. That's 65 times more.

These systems consume massive amounts of electricity, and the newer Blackwell offerings use even more. So SemiAnalysis also looked at how many tokens each system generated per megawatt. On this measure, Hopper churned out 54,000 tokens per second, while Blackwell generated 2.8 million. 50 times more.

Electricity prices are rising, due to all these energy-sipping AI data centers. So these days, GPU systems are assessed based on how much it costs to generate one million tokens.

SemiAnalysis tested this, too, and found that the older Hopper system cost $4.20 for every million tokens. The Blackwell system cost 12 cents. That's 35 times cheaper.

Again, new AI models will be increasingly trained and run on these new Blackwell systems as 2026 progresses. This is very likely to produce a massive increase in the number of cheaply-generated tokens.

This is why AI model providers will probably slash token prices: Because they can.

Sign up for BI's Tech Memo newsletter here. Reach out to me via email at [email protected].

Read next

Alistair Barr is the author of Business Insider's Tech Memo newsletter. Sign up here. Before that, he was BI's Global Tech Editor and the Big Tech team leader at Bloomberg, following a reporting career at The Wall Street Journal, USA Today, Reuters, and MarketWatch. Alistair won a Gerald Loeb Award in 2007 for coverage of short selling and was a finalist in 2013 for scoops on the Facebook IPO. More recently, he won a 2024 San Francisco Press Club award for commentary. Got a tip? Reach out using the secure messaging app Signal (+1 415-341-4927) or via email on [email protected].ExpertiseAlistair oversees all things Big Tech, along with startups and venture capital. He writes analysis and columns about topics including generative AI, large language models, cloud computing, semiconductors, online search, e-commerce, EVs, robotics, and autonomous vehicles.Popular StoriesArtificial Intelligence:It's getting harder to make big leaps at the frontier of AIOpenAI's AI-adjusted earnings numbers have echoes of Groupon and WeWorkDeath by LLM: Stack Overflow's decline, and its plan to survive, shows the future of free online data in an AI worldCloud computing:Amazon dominated the first cloud era. The AI boom has kicked off Cloud 2.0, and the company doesn't have a head start this time.In cloud, there's AI (which is hot) and everything else (which is not)Chips:Why Intel is still so important: Real countries have fabsApple's made-in-the-USA chips signal a turnaround for the US's big semiconductor betEVs and Tesla:Tesla's AI supercomputer has a Silicon Valley town rushing to meet surging electricity demandTesla's Cybertruck is outselling almost every other EV in the USOnline Search:Google is losing its status as a verbA simple way to fix search: Bright pink ads

Read Entire Article
| Opini Rakyat Politico | | |