• Some users have recently had their accounts hijacked. It seems that the now defunct EVGA forums might have compromised your password there and seems many are using the same PW here. We would suggest you UPDATE YOUR PASSWORD and TURN ON 2FA for your account here to further secure it. None of the compromised accounts had 2FA turned on.
    Once you have enabled 2FA, your account will be updated soon to show a badge, letting other members know that you use 2FA to protect your account. This should be beneficial for everyone that uses FSFT.

The AI hardware crunch: CPUs join the chip shortage

MrGuvernment

Fully [H]
2FA
Joined
Aug 3, 2004
Messages
23,936
Not surprised.. everything will be in shortage when it comes to what is used...

The AI hardware crunch: CPUs join the chip shortage
https://qz.com/ai-cpu-shortage-2026

For three years, the defining hardware story of artificial intelligence has been a GPU shortage. Companies hoarded Nvidia $NVDA -1.34%'s graphics processors like gold bars, willing to pay almost anything for the chips that power AI training. Entire business strategies revolved around who could get access and who couldn't.
Now a second shortage is emerging alongside it, and this one has caught the industry genuinely off guard. The humble CPU, the generalist processor that has powered computers for decades, is suddenly in desperately short supply.
Intel $INTC -5.25% warned Chinese customers in recent weeks that delivery lead times for some server CPUs have stretched to six months. AMD -3.34% has pushed its own lead times to eight to ten weeks. Server CPU prices in China have jumped more than 10 percent. In the US and Europe, PC prices are creeping upward too, as chipmakers divert manufacturing capacity from consumer products toward data centers hungry for more processors.

Intel's CFO David Zinsner admitted during the company's January earnings call that demand had blindsided them. Six months ago, he said, every major cloud customer was signaling they would need more powerful CPUs but not necessarily more of them. That forecast turned out to be wrong. Unit demand surged through the second half of 2025, and Intel now finds itself running its fabs “hand to mouth,” shipping processors as fast as they come off the line.

The agent problem​

The explanation sits at the intersection of two trends that converged faster than anyone in the semiconductor supply chain anticipated.
The first is straightforward. Microsoft $MSFT -0.32% ended support for Windows 10 last October, triggering a wave of PC upgrades. Many of those buyers opted for cheaper machines running older Intel chips rather than the pricier AI-enabled PCs that Intel and Microsoft had been pushing. That created unexpected demand for processors Intel had been winding down.
The second trend is more structurally interesting. The AI industry is shifting from building chatbots to deploying autonomous software agents, and this shift is fundamentally changing the ratio of hardware that data centers need.
When you ask ChatGPT a question, the CPU does very little. It converts your text into tokens, hands them to the GPU for processing, and converts the answer back. The GPU does perhaps 90 percent of the work.
But agentic AI systems behave differently. They plan, execute multi-step tasks, call APIs, query databases, write and run code, coordinate dozens of sub-processes, and evaluate whether they succeeded before starting over. All of that work happens on CPUs.
AMD CEO Lisa Su put a finer point on it during her earnings call. The more autonomous AI agents become, she said, the more they depend on the oldest, least glamorous chip in the server rack.
Translation: the server CPU market is about to have a very good year. Su predicted strong double-digit growth in 2026.

Squeezing the wrong chips​

Intel and AMD are short for entirely different reasons, which makes the problem harder to solve.
Intel has been struggling with manufacturing yields at its own fabrication plants, limiting how many usable chips it can produce from each silicon wafer. The company is investing in new tools and reallocating capacity from PC chips to server chips, but the improvements won't arrive until later this year at the earliest.
AMD doesn't make its own chips. It relies on TSMC $TSM -4.41% in Taiwan, the world's most advanced contract manufacturer. But TSMC is prioritizing its most advanced production lines for higher-margin AI accelerators and GPUs, leaving less room for CPU orders. TSMC's chairman has acknowledged that the company can only produce about a third of what its biggest customers want.
Meanwhile, a global memory chip shortage is making everything worse. When memory prices started climbing late last year, customers rushed to lock in CPU purchases too, hoping to assemble complete server systems before costs spiraled further. That panic buying deepened the CPU backlog.
Nvidia, sensing opportunity, is pushing aggressively into the market. Its server CPUs are already deployed at scale inside Meta $META -2.34%'s data centers for workloads that don't require a GPU at all, and a next-generation chip designed for agentic reasoning arrives next year. Jensen Huang said in January he sees Nvidia becoming a major CPU producer.
The irony is that the technology most likely to be affected by the CPU shortage is AI itself. Companies racing to deploy agents may find that the bottleneck isn't the expensive GPU they fought so hard to secure but the cheap, unglamorous processor they assumed would always be available.
 
Oh FFS! Who's buying cpus if you can't even get the ram for them?

More like every industry finding excuses to hike up prices

The same guys who are buying all of the RAM are buying the CPUs; datacenters. What you'll likely see is a diversion of silicon that would have made, say, a 9800x3D, over to silicon to make CPUs (or other components) that go into datacenters.
 
The same guys who are buying all of the RAM are buying the CPUs; datacenters. What you'll likely see is a diversion of silicon that would have made, say, a 9800x3D, over to silicon to make CPUs (or other components) that go into datacenters.
Only 70mm of tsmc 4 with a bit of old tsmc 6 to make one of those, it is under msrp on newegg and by a good amount on amazon, maybe the effect of the shift will appear on those eventually, but with ram price naturally reducing their demands, they could stay available and at msrp.

So what happens to all the unsold datacenter chips in inventory when the bubble crashes?
always a risk for costly product that can take ~5/6 months to make, they are probably all pre-sold to customer but they can always go bankrupt for the non google type. They would have made good money for years until loosing some on the last batch.

CPU are generalist device that can be used for many things, they could find someone at the right price for something else.
 
So what happens to all the unsold datacenter chips in inventory when the bubble crashes?

There's a fire sale and someone is left holding the bag. The only thing we can safely say is the bag holder won't be TSMC. Post covid, they got screwed over with customers making strategic reservations on capacity and then cancelling. They aren't doing that this time. It's going to be rough on the RAM makers as when they get stuck with cancelled delivery on HBM ram, it's not like they can sell it to us in a fire sale. And while they are getting premium prices, it's not like NVIDIA where it can be cut in third and still be worth more than they absolutely need to sell it for.

We KNOW that many of these orders are strategic in order to limit availability to competitors. We also know a lot of the buying is on borrowed money. And nobody is really making a profit yet, much less enough of one to cover the debt they'd need to finish what is currently in the pipeline.

We'll probably wind up with a lot fewer AI companies, lots of bigger companies buying up assets at bankruptcy pricing, the suppliers who cut us plebes off scrambling for revenue, and ultimately less hardware suppliers overall.
 
So what happens to all the unsold datacenter chips in inventory when the bubble crashes?

A lot of the guys running the AI companies are saying they don't care because they'll use the compute for something else.
 
A lot of the guys running the AI companies are saying they don't care because they'll use the compute for something else.
Curious how many secondary markets exist when everyone wants the newest hotness for their models.
 
Curious how many secondary markets exist when everyone wants the newest hotness for their models.

Eh tons of homelabbers run older stuff - can still see v3/v4/v5/v6 Xeons being unloaded online etc

My v2 Xeon I'm using in my HTPC ATM (haven't really been seeing these being unloaded since like 2019/2020) is at the end of the line for gaming but still capable there (though again this is the last gen it can be any sort of bearable/tolerable for gaming as it at the worst of times can only hit 30FPS but then with framegen can hit '60FPS' still) - but when it comes to running VMs in Win Server 2025 it's still fine and 'zippy' (have an image for that I go back and forth with vs the HTPC image to play around with)
 
Curious how many secondary markets exist when everyone wants the newest hotness for their models.

Google says they’re running 8 year old hardware at 100% right now, so no, companies aren’t simply writing off and discarding things every generation as some seem to want to insist when they want to claim things like accounting fraud predicated on a shorter-than-reality depreciation cycle.

https://www.datacenterdynamics.com/...old-hardware-iterations-have-100-utilization/
 
If AMD want to sales me cheap epyc platform and CPU I could be tempted....

but GPT 5.4 let me feel that it could never really pop, agents make it both quite commercially useful and easy to burn giant amount of tokens per minutes 24 hours a day.

the AI hype was a bit 1 year ahead of reality, 2025 was not the year of the agents, 2026 is, 2026 will not be the year of massive amount of real world robots that run advanced AI but late 2027/early 2028 could be, which will explode multi-modal use and commercial value.

IT would take quite the efficacy explosion or agents having issues working together on a problem (they seem not bad at scaling and co-working right now) to have too much compute anytime soon
 
Eh tons of homelabbers run older stuff - can still see v3/v4/v5/v6 Xeons being unloaded online etc

My v2 Xeon I'm using in my HTPC ATM (haven't really been seeing these being unloaded since like 2019/2020) is at the end of the line for gaming but still capable there (though again this is the last gen it can be any sort of bearable/tolerable for gaming as it at the worst of times can only hit 30FPS but then with framegen can hit '60FPS' still) - but when it comes to running VMs in Win Server 2025 it's still fine and 'zippy' (have an image for that I go back and forth with vs the HTPC image to play around with)
I'm selling most of my DDR4 hardware and for homelab stuff and going back to DDR3. v2 Xeons are just as good for my purposes. The lower latency DDR3 can actually be better in some cases I think.

If it gets bad enough I might buy one of those chinese dual x99 DDR3 boards and throw some V4 Xeons in it with 512GB DDR3..... but it's gotta get REAL bad before I do that.
 
I'm selling most of my DDR4 hardware and for homelab stuff and going back to DDR3. v2 Xeons are just as good for my purposes. The lower latency DDR3 can actually be better in some cases I think.

If it gets bad enough I might buy one of those chinese dual x99 DDR3 boards and throw some V4 Xeons in it with 512GB DDR3..... but it's gotta get REAL bad before I do that.

I'm a big proponent of buying gear + running it into the ground/repurposing for as long as you can/it makes sense - aside from the fact I mentioned my v2 system is on its last legs in terms of gaming now - the 10w Intel N150 embedded chips are ALMOST equal to it - next ASROCK consumer board that comes out with a 10w embedded Intel chip that exceeds it will get me to finally upgrade to/for that (will become my new NAS/server replacing my Synology 918+ which will in turn become redundant backup cold storage - and my current 5950x rig will take the place of the v2 system as a HTPC/test and mess around system) - then depending on how that goes/how many cores it has/how more powerful it is - I'll either keep the v2 setup for VMs or repurpose as a legacy system in the rack (runs Windows XP natively/has drivers for it).
 
Last edited:
I'm a big proponent of buying gear + running it into the ground/repurposing for as long as you can/it makes sense - aside from the fact I mentioned my v2 system is on its last legs in terms of gaming now - the 10w Intel N150 embedded chips are ALMOST equal to it - next ASROCK consumer board that comes out with a 10w embedded Intel chip that exceeds it will get me to finally upgrade to/for that (will become my new NAS/server replacing my Synology 918+ which will in turn become redundant backup cold storage - and my current 5950x rig will take the place of the v2 system as a HTPC/test and mess around system) - then depending on how that goes/how many cores it has/how more powerful it is - I'll either keep the v2 setup for VMs or repurpose as a legacy system in the rack (runs Windows XP natively/has drivers for it).
The system that I just swapped out was a v3 Xeon - I swapped to a v2 and performance wise it's easily as good. I gotta get the idle power consumption down with BIOS settings but it's great so far.
 
Back
Top