Kafka time lag metrics can silently underreport consumer delays due to two features: log compaction and retention deletion. When a consumer's committed offset points to a deleted message, Kafka returns the next available message with a newer timestamp, making lag appear smaller than it really is. In extreme cases, reported lag

7m read timeFrom softwaremill.com
Post cover image
Table of contents
Quick Recap: How Time Lag WorksLog Compaction ProblemRetention Deletion: When Your Offsets Fall Off the CliffDetection: How klag-exporter Catches These LiesWhat to Do About ItFor Monitoring Dashboards

Sort: