Description
The manifesto describes AI in operations: "incident detection, root cause analysis, and remediation suggestions." Git-ape currently stops at post-deployment health checks. This issue establishes Day-2 operational foundations.
Related: #18 (drift detection — another Day-2 capability)
Scope
- TTL Reaper workflow —
git-ape-ttl-reaper.yml (or gh-aw agentic workflow) that checks deployment TTL (set in metadata.json) and auto-destroys expired resources after notification.
- Monitoring setup — During deployment, auto-configure Azure Monitor alerts for key metrics (availability, errors, latency).
- Post-deploy monitoring skill —
/azure-monitor-checker that queries Azure Monitor for resource health status. Enables @git-ape status <deployment-id>.
- Runbook generation — Auto-generate operational runbooks from deployment architecture (what to check, how to restart, escalation paths).
- Azure SRE Agent compatibility — Ensure deployment artifacts (architecture diagrams, runbooks) are consumable by Azure SRE Agent.
Acceptance Criteria
Description
The manifesto describes AI in operations: "incident detection, root cause analysis, and remediation suggestions." Git-ape currently stops at post-deployment health checks. This issue establishes Day-2 operational foundations.
Related: #18 (drift detection — another Day-2 capability)
Scope
git-ape-ttl-reaper.yml(or gh-aw agentic workflow) that checks deployment TTL (set inmetadata.json) and auto-destroys expired resources after notification./azure-monitor-checkerthat queries Azure Monitor for resource health status. Enables@git-ape status <deployment-id>.Acceptance Criteria
@git-ape status <deployment-id>shows resource health.