The Hidden Scale of AI: Why Power, Memory and Networks Now Dwarf GPU Spending

Expectations for AI infrastructure spending keep climbing. Analysts had pegged hyperscaler capital expenditures for 2026 at roughly $600 billion. After the latest earnings reports, that figure jumped to about $750 billion. Growth now sits at 67 percent year over year. Projections for 2027 already flirt with $1 trillion. Yahoo Finance laid out the revised numbers in early June.

Those sums represent more than chips. They cover land, substations, miles of fiber, advanced cooling loops and specialized memory stacks. The original TechRadar analysis captured this shift early. GPUs launched the boom. Everything else now dictates its limits. TechRadar made the case three hours ago that CPUs, memory bandwidth, networking fabric and workflow orchestration define the next phase.

Data center electricity demand surged 17 percent in 2025. AI-focused facilities grew even faster. Global electricity use rose just 3 percent in comparison. The International Energy Agency expects total data center consumption to double by 2030 while AI-specific demand triples. IEA released the update in April. Tech companies signed 40 percent of all corporate renewable power purchase agreements last year. Yet that pace cannot close the gap alone.

Power has become the binding constraint. A single modern AI rack draws 150 kilowatts today. Nvidia’s upcoming Rubin architecture pushes toward 300 kilowatts. Future designs could hit one megawatt per rack. That equals the electricity for roughly 750 homes. Traditional CPU-based data centers ran at 25 to 40 kilowatts per rack. The density jump forces complete redesigns of power delivery, cooling and building layouts. Bloomberg examined the physical reckoning in a June feature.

Thirty percent of power supplied to these facilities gets lost to cooling, voltage conversion and transmission inside the building. Engineers chase incremental gains. Liquid cooling delivers 15 percent better efficiency and trims emissions 10 percent. Sidecar AC-DC converters save space and boost efficiency another 20 percent. Solid-state transformers improve performance 27 percent. Some teams target 800-volt direct current architectures by 2030 to drop internal losses below one percent. Nvidia has invested in efficiency startups such as Emerald AI to accelerate progress.

But. Hardware tweaks only buy time. The real conversation has moved to nuclear power. The pipeline of conditional offtake agreements for small modular reactors jumped from 25 gigawatts at the end of 2024 to 45 gigawatts now. Tech giants pursue direct stakes in reactor developers. They sign long-term deals to restart retired plants. Onsite natural gas generation expands in the United States while battery storage grows critical for smoothing demand spikes.

Hyperscalers spent more than $400 billion on capital projects in 2025. That total climbs 75 percent this year. Five companies alone — Amazon, Microsoft, Alphabet, Meta and Oracle — account for the bulk. Oracle’s AI-related capital expenditure hit $50 billion with a 76 percent capex-to-revenue ratio. These outlays dwarf previous technology buildouts. They rival the GDP of mid-sized nations.

Memory emerged as the next choke point. High-bandwidth memory suppliers report explosive orders. Training clusters demand ever-larger shared memory pools measured in petabytes. Inference workloads shift emphasis toward fast access and low latency rather than raw floating-point operations. CPUs regain relevance for orchestration, scheduling and agentic systems that coordinate multiple models. Morgan Stanley sized the incremental CPU opportunity at $32.5 billion to $60 billion by 2030 within a broader server CPU market exceeding $100 billion.

Networking fabric receives equal attention. Google detailed its Virgo network in April. The collapsed fabric design quadruples bandwidth over prior generations and removes much of the scaling penalty. One data center can now link 134,000 TPUs or 80,000 Nvidia GPUs with near-linear efficiency. Clusters spanning multiple sites reach one million accelerators. Google Cloud Blog described the architecture.

Land and grid connections add further friction. Securing sites with adequate transmission capacity can take years. Utilities cite five-to-ten-year lead times for new substations. Some hyperscalers explore co-location next to power plants. Others fund transmission upgrades directly. Private equity joins the fray. KKR, Nvidia, Vistra and the Kuwait Investment Authority committed more than $10 billion in June to Helix Digital Infrastructure. The vehicle coordinates data centers, power procurement and connectivity for hyperscalers. The Wall Street Journal reported the launch. Reuters covered the same deal days later.

China pursues its own centralized push. Beijing plans a $295 billion five-year project to knit data centers into a national compute backbone. The effort relies on sovereign debt and state-backed funds. It contrasts with the private-sector frenzy in the United States where construction spending on data centers topped $50 billion in a single month this spring. Bloomberg compared the two approaches in mid-June.

Stephen Balaban, co-founder and CTO of Lambda, described the situation bluntly in a recent podcast. Most leadership teams at neoclouds believe the industry remains massively underbuilding. Standing up a gigawatt-scale AI factory requires far more than racks of GPUs. Land, reliable power measured in billions of watts, sophisticated cooling, ultra-low-latency networking and novel financing structures all matter. The conversation has shifted from token generation to full AI factories. The MAD Podcast with Matt Turck featured his remarks one week ago.

Enterprise adoption reveals another layer. Early experiments with generative tools produced impressive drafts yet carried hidden costs. Microsoft dialed back certain Claude Code licenses that ran $500 to $2,000 per engineer each month. Uber rolled the same tool to 5,000 engineers, generated 70 percent of committed code in some projects, then exhausted its AI coding budget inside four months. These episodes illustrate why unstructured AI usage proves expensive. Agentic workflows that perform actual business processes demand clean data pipelines, governance layers, integration frameworks and monitoring systems. The infrastructure conversation now includes those elements too. TechRadar returned to this theme in its latest perspective.

Financial markets have noticed the broadening opportunity. Flash memory provider SanDisk shares rose 464 percent this year. Bloom Energy, a fuel-cell specialist, gained 198 percent. Hard-disk makers Seagate and Western Digital each climbed around 180 percent. Intel posted 197 percent gains amid renewed interest in its Gaudi accelerators and foundry services. The winners extend well past traditional semiconductor names. Morningstar tracked the performance in May.

Yet risks accumulate. Overbuilding remains possible if utilization rates disappoint. Grid congestion could delay projects or raise electricity prices for surrounding communities. Public opposition to new nuclear or gas plants may surface even as sentiment tilts toward nuclear over sprawling data center campuses in some locales. OpenAI outlined its Stargate Community plan in January to ensure its data centers pay their own way on energy and avoid burdening local rates. Reuters detailed the initiative.

The boom continues. Hyperscalers show no sign of slowing. Their combined 2026 spending equates to 2.2 percent of U.S. GDP. That share will likely expand. What began as a race for accelerators has become a contest for energy dominance, land suitable for industrial-scale computing, memory hierarchies that feed hungry models and networks that bind everything together without friction.

Shortages in metals, skilled labor for construction and components for transformers already appear in project timelines. Battery storage, long-duration energy assets and flexible data center designs that ramp with grid signals gain traction. The industry experiments with locating compute next to renewable or nuclear sources rather than forcing power across hundreds of miles.

And the numbers keep rising. What looked enormous six months ago appears conservative today. The physical plant required to train and run the next generation of models will test supply chains, regulatory processes and capital markets in ways few anticipated when the first large language models captured public attention. Power no longer serves as mere input. It defines the ceiling.

The Hidden Scale of AI: Why Power, Memory and Networks Now Dwarf GPU Spending

Notice an error?

Ready to get started?