Best of PrometheusOctober 2024

  1. 1
    Article
    Avatar of attiAT&T Israel·2y

    Memory Leak Profiling and Pinpointing for Node.js

    We identified and fixed a memory leak in a Nest.js application running on Kubernetes, which was caused by open handles not being properly closed during proxy requests to Grafana. Using Prometheus and Grafana for monitoring, along with custom modifications to the wtf-node dependency, we pinpointed the issue and adjusted the middleware to prevent the memory leak. Key steps include integrating prom-client with Grafana and using wtf-node to diagnose active handle issues.

  2. 2
    Article
    Avatar of faunFaun·2y

    Learning Go by Instrumenting a Go Application for Prometheus Metrics

    A beginner's guide to learning Go by instrumenting a Go application for Prometheus metrics. This tutorial covers building a Prometheus metrics exporter to consolidate analytics and metrics from Datadog's SLO product. Key steps include parsing the Datadog API response, creating necessary structs in Go, declaring Prometheus metrics, initializing the Datadog client, fetching SLO data, and pushing the metrics to Prometheus.

  3. 3
    Article
    Avatar of grafanaGrafana Labs·2y

    How to use Prometheus to efficiently detect anomalies at scale

    Discover how an effective anomaly detection framework was built using Prometheus and PromQL. Learn about setting up average and standard deviation recording rules, tuning parameters like time windows and multipliers, and addressing challenges such as extreme outliers, low sensitivity, and seasonality. The reusable framework works for any metric and can integrate with your existing Prometheus setup to enhance incident investigation and root-cause analysis.

  4. 4
    Article
    Avatar of last9Last9·2y

    Prometheus Alertmanager: What You Need to Know

    Prometheus Alertmanager helps manage alerts in a production environment by organizing, routing, and deduplicating alerts, thereby reducing alert fatigue. It supports features like alert grouping, silencing, inhibition, and high availability setups. To effectively use Prometheus Alertmanager, ensure to configure alert conditions properly, use grouping and inhibition to avoid notification spam, and implement security best practices such as authentication and TLS encryption. Periodically review and audit alerts to keep configurations relevant and improve upon past incident learnings.