Diagnose intermittent latency spikes where p99 diverges from p50 by 100x, using distributed tracing, GC analysis, lock contention profiling, and network diagnostics to isolate the cause.
## Problem
Your service has a healthy p50 latency of 5ms, but p99 has spiked to 500ms — a 100x divergence. The spikes are intermittent, occurring every 2-5 minutes and lasting 1-3 seconds. The SLO requires p99 under 50ms. Diagnose the root cause of these latency spikes.
Sign up to access the full problem
Design canvas, rubric, hints, and model solutions.