-
Notifications
You must be signed in to change notification settings - Fork 1
Alerting rule recommendations: Suggest tuning to reduce noise #13
Copy link
Copy link
Open
Labels
area:deduplicationAlert dedup and noise reductionAlert dedup and noise reductionarea:observabilityObservability tool integrationsObservability tool integrationsphase:3-remediationPhase 3 — Advanced Remediation & OutputPhase 3 — Advanced Remediation & Outputpriority:highImportant, do if time allowsImportant, do if time allowstype:featureNew feature or capabilityNew feature or capability
Metadata
Metadata
Assignees
Labels
area:deduplicationAlert dedup and noise reductionAlert dedup and noise reductionarea:observabilityObservability tool integrationsObservability tool integrationsphase:3-remediationPhase 3 — Advanced Remediation & OutputPhase 3 — Advanced Remediation & Outputpriority:highImportant, do if time allowsImportant, do if time allowstype:featureNew feature or capabilityNew feature or capability
Summary
After resolving incidents, the agent should recommend alerting rule changes to prevent the same noise from recurring. This could be threshold adjustments, grouping changes, or silencing flapping alerts.
Why This Matters
Fixing the incident is half the battle. Fixing the alerting rules that generated noise is how you prevent it from happening again. This closes the feedback loop.
Examples
HighMemoryUsagefired 47 times this week but never led to an incident. Recommend raising threshold from 80% to 90%."PodRestartandCrashLoopBackOffalways fire together. Recommend grouping into a single alert rule."HighLatencyon service-X fires every deploy and auto-resolves in 2 minutes. Recommend adding a 3-minute pending period."Acceptance Criteria
nightops alerts tuneto show recommendations