OpenAI described how it scaled PostgreSQL to support ChatGPT and its API platform, handling millions of queries per second for hundreds of millions of users. By running a single-primary PostgreSQL dep

InfoQ is a leading online platform for software developers, architects, and technical leaders, providing news, articles, presentations, and interviews on a wide range of topics, including agile practices, DevOps, microservices, and emerging technologies. With a focus on quality content and expert insights, InfoQ helps professionals stay informed about the latest trends, best practices, and industry developments. Developers can learn from real-world experiences, gain  knowledge, and connect with peers in the global software community through InfoQ's diverse and engaging content.

InfoQ

OpenAI scaled a single-primary PostgreSQL instance to handle millions of queries per second for ChatGPT's 800 million users by deploying nearly 50 geo-distributed read replicas on Azure, optimizing query patterns, and offloading write-heavy workloads to sharded systems like Azure Cosmos DB. Key strategies included connection pooling with PgBouncer, reducing write pressure through application-level tuning, implementing cascading replication to reduce primary load, and isolating critical workloads to maintain low-latency performance under global traffic spikes.

OpenAI Scales Single Primary Postgresql to Millions of Queries per Second for ChatGPT