Best of SpaceliftAugust 2025

  1. 1
    Article
    Avatar of spaceliftSpacelift·43w

    5 Methods to Keep Docker Container Running for Debugging

    Docker containers stop when their foreground process terminates, making debugging challenging. Five methods can keep containers running for debugging: overriding commands with interactive shells, using 'tail -f /dev/null', using 'sleep infinity', committing container state to new images, and wrapping entrypoints with custom scripts. Additional debugging practices include checking container logs and exit codes, using shell sessions to investigate environments, implementing monitoring systems, and considering process managers like supervisord for advanced scenarios.

  2. 2
    Article
    Avatar of spaceliftSpacelift·39w

    12 Biggest DevOps Challenges in 2025 (and How to Fix Them)

    DevOps teams in 2025 face 12 major challenges including cultural resistance, security vulnerabilities, tool selection paralysis, developer access barriers, poor visibility, governance difficulties, cost overruns, CI/CD performance issues, scalability problems, metric selection confusion, skill shortages, and over-focusing on tools rather than people. Solutions include clear communication for cultural change, implementing DevSecOps practices, structured tool evaluation processes, building internal developer platforms, comprehensive monitoring strategies, policy-as-code governance, centralized cost tracking, pipeline optimization, scalability planning, focusing on DORA metrics, continuous upskilling programs, and maintaining a people-first approach to DevOps implementation.

  3. 3
    Article
    Avatar of spaceliftSpacelift·42w

    Kubernetes Observability: Pillars, Tools & Best Practices

    Kubernetes observability involves collecting metrics, logs, and traces to understand cluster internal state and performance. The three pillars include metrics for quantitative data, logs for timestamped events, and traces for request paths through microservices. Key tools include Metrics-Server for basic monitoring, Kube-Prometheus-Stack for comprehensive metrics and visualization, ELK stack for log management, and OpenTelemetry for distributed tracing. Implementation challenges include managing multiple data types, monitoring dynamic resources, handling large data volumes, and preventing data silos. Best practices emphasize setting up alerts, consistent resource labeling, application instrumentation, selective data collection, and compliance alignment.