Replication issues in ClickHouse are silent by nature and can accumulate unnoticed until a failover exposes stale data. The key monitoring source is the system.replicas table, which exposes per-table metrics like queue_size, inserts_in_queue, merges_in_queue, and replication_lag (computed as log_max_index - log_pointer). Alerts
Table of contents
How Replication Works in ClickHouseWhat to MonitorDiagnosing and Fixing Replication LagKey TakeawaysSort: