Design a distributed tracing system that propagates context across 5,000+ microservices, ingests 500K spans/sec, and supports trace lookup in under 2 seconds.
## Problem
Design a distributed tracing system similar to Jaeger or Zipkin that captures request flows across 5,000+ microservices. The system must propagate trace context through every hop, collect and store spans, support intelligent sampling to control costs, and enable engineers to query and visualize full traces for debugging production incidents.
Sign up to access the full problem
Design canvas, rubric, hints, and model solutions.