Diagnose a DNS outage affecting service discovery, analyzing the resolver chain, TTL behavior during failures, negative caching effects, and designing DNS failover strategies.
## Problem
All inter-service communication in your infrastructure has stopped. Services report "connection refused" or "unknown host" errors when trying to reach each other. External domains (google.com, etc.) resolve normally. The issue started 15 minutes ago and affects all services. Diagnose the DNS failure, identify the failing component, and restore service communication.
Sign up to access the full problem
Design canvas, rubric, hints, and model solutions.