As AI demand surges, tiny distributed inference data centers tap into unused grid capacity, helping avoid costly infrastructure and grid connection delays.

IEEE Spectrum's platform is a central hub for technology enthusiasts and professionals, offering insights into  technologies, engineering innovations, and scientific discoveries. Through articles, reports, and interviews, IEEE Spectrum offers insights into emerging technologies, research breakthroughs, and industry trends across various domains. Readers can stay updated with the latest advancements in technology and explore the impact of technology on society and the environment.

IEEE Spectrum

Nvidia, EPRI, InfraPartners, and Prologis are piloting a fleet of roughly 25 micro data centers (5–20 MW each) co-located at utility substations across five U.S. utilities. The strategy, called 'distributed inference,' exploits the fact that AI inference workloads can be dynamically routed between locations, allowing compute to shift to whichever substation has spare capacity. This sidesteps the decade-long grid connection queues facing large data centers, reduces the need for new transmission infrastructure, and taps the ~47% of U.S. generation capacity that sits idle outside peak demand hours. Construction of the pilot fleet is targeted for late 2026, with workload rerouting expected to be needed only about 0.1% of the time.

Grid Flexibility and Distributed Inference Data Centers

Building energy flexibility into data centers