Google is announcing Rapid Bucket, a high-performance object storage solution for Google Cloud that integrates with PyTorch via the gcsfs fsspec interface. By replacing REST-based storage access with persistent bidirectional gRPC streams backed by Google's Colossus architecture, it delivers a 23% training speed improvement and 4.8x read throughput gains. The integration requires zero code changes — developers only need to switch to a Rapid Bucket and upgrade gcsfs to version 2026.3.0. Benchmarks were run on 16 GKE nodes with A4 GPUs using PyTorch Lightning across 134M rows of data.
Table of contents
The challenge: Keeping GPUs fedRapid Bucket: Rapid Storage via bi-di gRPCFsspec - PyTorch’s Pythonic file interfaceUnder the hood: Leveraging ColossusResultsGet startedSort: