Skip to content

Add safe node restart runbook for jiva-ctrl nodes (PGM-223)#35

Merged
pgmac merged 1 commit into
mainfrom
paulymac/pgm-223-create-safe-rolling-node-restart-runbook-for-nodes-hosting
May 30, 2026
Merged

Add safe node restart runbook for jiva-ctrl nodes (PGM-223)#35
pgmac merged 1 commit into
mainfrom
paulymac/pgm-223-create-safe-rolling-node-restart-runbook-for-nodes-hosting

Conversation

@pgmac
Copy link
Copy Markdown
Contributor

@pgmac pgmac commented May 30, 2026

Summary

  • New runbook src/runbooks/jiva-ctrl-node-rolling-restart.md — safe pre-restart procedure for nodes hosting jiva-ctrl (iSCSI target) pods
  • Migrate workload pods and verify iSCSI sessions clear before restarting, preventing the EXT4 read-only cascade from the 2026-05-28 incident
  • Replaces the "to be written" PGM-223 placeholder in the jiva-ctrl eviction runbook with an actual link
  • Cross-referenced from: jiva-ctrl-eviction-iscsi-ro-filesystem.md, kubelet-silent-stall.md, PIR 2026-05-28
  • Added to mkdocs nav and runbooks index

Test plan

  • Review runbook commands for correctness (Step 2 iSCSI session check, Step 3 PVC lookup)
  • Verify all cross-reference links resolve in mkdocs
  • Confirm nav renders the new runbook in the Runbooks section

🤖 Generated with Claude Code

Documents the pre-restart migration procedure for nodes hosting jiva-ctrl
pods, preventing the iSCSI session drop → EXT4 read-only cascade identified
in the 2026-05-28 incident.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@pgmac pgmac merged commit 24c2931 into main May 30, 2026
1 check passed
@pgmac pgmac deleted the paulymac/pgm-223-create-safe-rolling-node-restart-runbook-for-nodes-hosting branch May 30, 2026 10:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant