Feat/webhook levels prompt gurads by 403ENDer · Pull Request #4999 · superplanehq/superplane

403ENDer · 2026-05-25T18:38:55Z

Summary

This PR introduces two major workstreams:

Webhook reconciliation and subscription binding enhancements
AI prompt guardrails and execution safety enforcement

Webhook Enhancements

The webhook-related changes introduce:

Scope-based reconciliation support
Subscription binding persistence
Webhook operation tracking
Shadow-mode drift detection infrastructure

These changes lay the foundation for scalable and deterministic webhook lifecycle management across integrations.

AI Prompt Guardrails

The guardrails-related changes introduce:

Post-interpolation prompt scanning
Policy-based enforcement
Audit persistence
Soft/hard block execution handling
Human-review workflow support for AI-powered workflow components

Additional Documentation

Branch change documentation:
https://docs.google.com/document/d/1mYKxcyXxF-kckvtUkxRICF08MJMOsbcD-lGthbkOaxA/edit?usp=sharing
MCP server roadmap and suggestions document:
https://docs.google.com/document/d/1kMkTjmXY0yI0TvPEV6LgMyAEGbElqw1VpyHeql_DqWw/edit?usp=sharing

Notes

Webhook reconciliation features are rollout-gated through environment flags and currently operate in shadow/observe-only mode.
Guardrails default to audit-only behavior unless enforcement policies are explicitly configured.
Some guardrail-related changes may be split into follow-up PRs based on review feedback.

Implements a 5-phase prompt guardrail system to detect and block injection attacks, secret leakage, and unsafe instructions in AI node prompts before they reach the LLM provider. Phase 0 – Schema: Two DB migrations adding 5 guardrail tables (prompt_guardrail_policies, prompt_scan_results, prompt_classifier_results, prompt_override_approvals, prompt_guardrail_bypass_tokens) and 2 execution columns. Field metadata extended with PromptField/SystemPromptField markers. Phase 1 – Dark-launch rule engine: pkg/guardrails/ package with 8 detection rules (6 secret, 2 injection), audit-only default policy, and integration in node_executor via FeaturePromptGuardrails flag. Phase 2 – Warn-only tier: ScanConfiguration returns ScanOutcome; warn_only executions write guardrail_warning to execution Metadata JSON without blocking. Phase 3 – Soft-block + GuardianWorker: GuardrailGuardianWorker polls blocked executions every 30s; resumes on override_approved or times out with guardrail_override_timeout after SoftBlockTimeoutSeconds. Phase 4 – Classifier infrastructure: Classifier interface, NoOpClassifier, ClassifierWorker polling pending jobs in batches, and policy service layer (GetOrgPolicy, UpsertOrgPolicy, ListPendingOverrides, ApproveOverride). Phase 5 – Full enforcement + admin API: AnthropicClassifier calls claude-haiku-4-5-20251001 to confirm findings and refine risk scores. Six admin HTTP handlers under /admin/api for managing org/workflow policies and approving soft-block overrides. Enable in production via env vars: START_GUARDRAIL_GUARDIAN_WORKER=yes START_CLASSIFIER_WORKER=yes ANTHROPIC_CLASSIFIER_API_KEY=<key> Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…o webhook-levels-prompt-guardils

superplanehq-integration · 2026-05-25T18:38:59Z

👋 Commands for maintainers:

/sp start - Start an ephemeral machine (takes ~30s)
/sp stop - Stop a running machine (auto-executed on pr close)

403ENDer and others added 3 commits May 25, 2026 22:26

webhook levels added

79ae445

Merge branch 'main' of https://github.com/superplanehq/superplane int…

4a302a1

…o webhook-levels-prompt-guardils

superplane-policy-bot Bot requested review from forestileao, lucaspin and shiroyasha May 25, 2026 18:39

Merge branch 'main' into feat/webhook-levels-prompt-gurads

b623a1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/webhook levels prompt gurads#4999

Feat/webhook levels prompt gurads#4999
403ENDer wants to merge 4 commits into
superplanehq:mainfrom
403ENDer:feat/webhook-levels-prompt-gurads

403ENDer commented May 25, 2026

Uh oh!

superplanehq-integration Bot commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

403ENDer commented May 25, 2026

Summary

Webhook Enhancements

AI Prompt Guardrails

Additional Documentation

Notes

Uh oh!

superplanehq-integration Bot commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant