Skip to main content
Definition

Error Budget

Definition

Error Budget

An error budget is the allowed margin for failures within a defined time period. It quantifies how much unreliability a service can tolerate before the team must prioritise stability over new features.

In detail

Error budgets are derived from SLOs. If a service has a 99.9% availability SLO, the error budget is 0.1% of the measurement period. For a 30-day month, that is approximately 43 minutes of allowed downtime.

When the budget is exhausted, the team shifts focus to reliability work. When the budget is full, the team can deploy faster and take more risks. This creates a self-regulating system that balances innovation with stability.

How Tallence helps

Tallence helps teams define SLOs and implement error budget policies as part of SRE adoption.

Learn more about SRE