Google has unvelied a new generation of Tensor Processing Units (TPUs), featuring two specialized chips designed to accelerate model training and agent workflows, which require continuous, multi-step

InfoQ is a leading online platform for software developers, architects, and technical leaders, providing news, articles, presentations, and interviews on a wide range of topics, including agile practices, DevOps, microservices, and emerging technologies. With a focus on quality content and expert insights, InfoQ helps professionals stay informed about the latest trends, best practices, and industry developments. Developers can learn from real-world experiences, gain  knowledge, and connect with peers in the global software community through InfoQ's diverse and engaging content.

InfoQ

Google has unveiled its 8th generation of Tensor Processing Units (TPUs), introducing two specialized chips: TPU 8t for large-scale model training and TPU 8i for low-latency inference. The TPU 8t delivers nearly 3x compute performance over the previous generation, scales to 9,600 chips in a single superpod with 2 petabytes of shared HBM, and can scale to a million chips in a single cluster. The TPU 8i targets agentic workloads with 288GB of memory, doubled ICI bandwidth at 19.2 Tb/s, and 80% better performance per dollar. Google's design philosophy of co-designing silicon with software and networking remains central to these gains.

Google New TPU Generation is Specifically Designed for Agents and SOTA Model Training