Reddit faced challenges handling scattered metadata across multiple systems. To address this, they built a unified media metadata store using AWS Aurora Postgres. This solution supports over 100K read requests per second with low latency. The setup included dual writes, data backfill, and robust data validation using Kafka for Change Data Capture (CDC). They also implemented range-based partitioning to ensure performance and scalability, enabling Reddit to handle expected volume growth efficiently.

5m read timeFrom newsletter.systemdesigncodex.com
Post cover image
Table of contents
Requirements of the Metadata StoreData Migration ProcessDealing with IssuesWhat about Scaling?Shoutout
2 Comments

Sort: