Design an error budget policy for a platform serving 100M requests/day, including budget calculation, burn rate alerts, exhaustion responses, and a stakeholder buy-in framework.
## Problem
Your organization runs a platform serving 100 million requests per day with a 99.95% availability SLO. Leadership has approved the concept of error budgets but needs a concrete policy: how budgets are calculated, how burn rates trigger alerts, what happens when the budget is exhausted, and how to get cross-functional buy-in. Design this error budget policy end-to-end.
Sign up to access the full problem
Design canvas, rubric, hints, and model solutions.
Explain SLO Trade-offs at Staff Level
Staff · Conceptual