Skip to content

AnonGoldup/database-runbooks

Repository files navigation

Database Runbooks

Operational runbooks for SQL Server database administration. Step-by-step procedures for incident response, disaster recovery, backup/recovery, maintenance, performance troubleshooting, and standardized templates.

Runbooks

Incident Response

  • database-incident-response.md - General incident response framework with severity classification
  • RUNBOOK-DatabaseOffline.md - Database in SUSPECT, RECOVERY_PENDING, or OFFLINE state
  • RUNBOOK-DiskSpaceCritical.md - Disk space triage: identify consumers, emergency recovery, root cause
  • RUNBOOK-HighCPU.md - CPU diagnosis: query identification, plan analysis, immediate mitigation
  • RUNBOOK-BlockingChain.md - Head blocker identification, kill decision criteria, root cause analysis
  • RUNBOOK-CorruptionDetected.md - DBCC CHECKDB errors: severity assessment, page restore, emergency repair

Disaster Recovery

  • RUNBOOK-FullDRFailover.md - Complete DR failover: database activation, DNS updates, app validation, failback
  • RUNBOOK-DRTest.md - Quarterly DR test procedure with RTO/RPO measurement and sign-off
  • DR-ContactSheet.md - Emergency contacts, escalation matrix, network info template

Backup & Recovery

  • backup-recovery-procedure.md - Full, differential, and log backup procedures with point-in-time and page-level restore

Planned Maintenance

  • patching-checklist.md - Pre-patch, patching, and post-patch verification for SQL Server cumulative updates

Performance Troubleshooting

  • performance-troubleshooting.md - Systematic bottleneck diagnosis: wait stats, I/O, CPU, memory, locking

Troubleshooting Guides

  • TSHOOT-SlowApplication.md - Decision tree: SQL vs application vs network performance issues
  • TSHOOT-ConnectivityIssues.md - SQL Server connectivity diagnosis: network, auth, firewall, SSL/TLS
  • TSHOOT-JobFailures.md - Common SQL Agent job failure patterns and resolution

Templates

  • TEMPLATE-Runbook.md - Standard runbook format with severity, procedure, escalation, rollback
  • TEMPLATE-ChangeRequest.md - Database change request with approvals and rollback plan
  • TEMPLATE-PostIncidentReview.md - Post-mortem template with timeline, root cause, and action items

Structure

Each runbook follows a consistent format:

  1. Summary - What the procedure covers
  2. Severity - Classification and response time
  3. Prerequisites - Required access and tools
  4. Procedure - Step-by-step instructions with embedded SQL
  5. Escalation - When and who to escalate to
  6. Rollback - Recovery steps if something goes wrong

Usage

These runbooks are designed to be followed during real incidents and maintenance windows. They assume the reader has SQL Server administrative access and basic T-SQL proficiency.

License

MIT

About

Operational runbooks for SQL Server database administration - backup/recovery, incident response, patching, and performance troubleshooting

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors