dial9 is a new runtime telemetry tool for Tokio that captures a full timeline of runtime events — individual polls, parks, wakes, and Linux kernel events — rather than just aggregate metrics. Built to diagnose production-only performance issues, it helped identify kernel scheduling delays of 10ms+ on an AWS service, fd_table

6m read timeFrom tokio.rs
Post cover image
Table of contents
What is dial9?Spotting kernel scheduling delayFinding fd_table contentionTasks are constantly movingA global lock in backtrace::traceGetting started
2 Comments

Sort: