Main Takeaways
- Look for steps in the data
- Investigate known incidents and regressions to find the right indicators
- Get granular, but stay significant
- Observe course data on a meta level
- Link related metrics
- Send SMART alerts, taking workload into consideration
- Track alerting effectiveness
- Continually research & experiment