Summary
Create an end-to-end demo scenario that showcases the complete Phase 1-3 pipeline: alert from Grafana → agent queries logs/traces/metrics → RCA with full evidence → auto-generated fix PR.
Why This Matters
This is the "wow" demo for meetups, conferences, and README recordings. It shows TheNightOps going from alert to fix PR autonomously.
Scenario
- Deploy demo app with a memory leak (existing
memory-leak scenario)
- Grafana alert fires → TheNightOps receives webhook
- Agent checks K8s (OOMKill detected) → Queries Grafana for memory trends → Queries Sentry for error stack trace
- Agent builds RCA: "Memory leak in BatchProcessor introduced in v2.4.1"
- Agent generates PR: bump memory limit + link to code fix recommendation
- Dashboard shows full investigation timeline with observability evidence
Acceptance Criteria
Summary
Create an end-to-end demo scenario that showcases the complete Phase 1-3 pipeline: alert from Grafana → agent queries logs/traces/metrics → RCA with full evidence → auto-generated fix PR.
Why This Matters
This is the "wow" demo for meetups, conferences, and README recordings. It shows TheNightOps going from alert to fix PR autonomously.
Scenario
memory-leakscenario)Acceptance Criteria
scripts/demo-gke.shor local mode