Skip to content

docs(verify): CSO fix, Ctrl+F mental model, sandbox extraction#21

Merged
bensonwong merged 1 commit intomainfrom
docs/verify-cso-ctrlf-sandbox
Apr 12, 2026
Merged

docs(verify): CSO fix, Ctrl+F mental model, sandbox extraction#21
bensonwong merged 1 commit intomainfrom
docs/verify-cso-ctrlf-sandbox

Conversation

@bensonwong
Copy link
Copy Markdown
Contributor

Summary

  • CSO description fix: rewrote the skill description field to be trigger-surface only (removes workflow content that violated the CSO "no workflow in description" rule); now lists semantic triggers like OCR, evidence, citations, source validation
  • Ctrl+F mental model for sourceMatch: replaced the ≤4-word counting heuristic with a "What would I Ctrl+F?" framing + a fact-type quick-reference table (dollar amounts, time limits, party names, priority tiers, trigger verbs, tax thresholds); root cause was models choosing the wrong thing to count, not miscounting
  • Sandbox extraction: moved the 25-line cloud sandbox constraint block to cloud-sandbox-constraints.md and replaced the inline section with a one-line reference — keeps SKILL.md focused on the workflow
  • Auth recovery simplification: collapsed the 15-line auth block to one sentence pointing to CLI-printed recovery options; removed hardcoded script -q -c command (CLI now emits Option A/B itself)
  • Parallel generation block: condensed 400-word sub-agent bullet into structured sub-bullets with Format 1 / Format 2 clearly separated
  • STOP AND CHECK gate: added a pre-verify checklist before Step 3 to catch common citation mismatches before running the CLI
  • Misc fixes: mkdir -p removed (CLI auto-creates dir), p field corrected ("1_0""page_number_1_index_0"), dangling "triage row above" reference made explicit

Test plan

  • Load the verify skill and confirm the description triggers on semantic phrases (OCR, verify claims, cite sources, find evidence) without naming the workflow
  • Run a verify session in a cloud sandbox ($CLAUDE_CODE_REMOTE=true) and confirm the agent reads cloud-sandbox-constraints.md before invoking deepcitation
  • Verify a doc with Format 2 citations and confirm p field uses page_number_N_index_I format (not N_I)
  • Confirm auth recovery no longer hardcodes the script command — agent should follow CLI-printed options
  • Confirm the STOP AND CHECK gate appears in agent output before verify is invoked

- Description: strip workflow content, reframe as trigger-only with semantic
  surface (OCR, evidence, citations, proof, source validation); fixes CSO
  "no workflow in description" rule
- In-text markers: replace ≤4-word counting with Ctrl+F search-key mental model
  and fact-type quick-reference table; counting failed because models chose the
  wrong thing to count, not because they miscounted
- Parallel generation block: condense 400-word bullet into structured sub-bullets
- Add STOP AND CHECK gate before Step 3 Verify
- Cloud sandbox: extract 25-line constraint set to cloud-sandbox-constraints.md,
  replace inline section with one-line reference
- Remove mkdir -p from prepare command (CLI creates .deepcitation automatically)
- Collapse 15-line auth recovery block to one sentence pointing to CLI output
- Fix p field format: "1_0" → "page_number_1_index_0" in Format 2 example
- Replace sub-agent word-count gate with Ctrl+F test + inline examples
- Fix "triage row above" dangling reference → explicit table name

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@bensonwong bensonwong merged commit 38dc3de into main Apr 12, 2026
1 check passed
@bensonwong bensonwong deleted the docs/verify-cso-ctrlf-sandbox branch April 12, 2026 21:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant