ClickPy, a Python package download analytics platform powered by ClickHouse, has reached 2 trillion rows of historical data. The team redesigned their ingestion pipeline by replacing a custom cron-based script with ClickPipes, using a staged approach with cloned databases to validate the migration without disrupting production.

8m read time From clickhouse.com
Post cover image
Table of contents
Replacing the legacy ingestion pipeline #Hot swap with ClickPipes at scale #Discovering historical ingestion gaps #Fix the past #Where this leaves ClickPy #

Sort: