Skip to content

docs(hubv3): human-in-the-loop acceptance testing runbook (§6 test 12)#2200

Open
Mikecranesync wants to merge 4 commits into
feat/plc-mapper-guifrom
docs/hubv3-hitl-testing
Open

docs(hubv3): human-in-the-loop acceptance testing runbook (§6 test 12)#2200
Mikecranesync wants to merge 4 commits into
feat/plc-mapper-guifrom
docs/hubv3-hitl-testing

Conversation

@Mikecranesync

Copy link
Copy Markdown
Owner

What

Adds the human-in-the-loop acceptance testing runbook for HubV3 — the witness procedure for PRD §6 test 12 (the cross-stack Garage Conveyor demo), the one acceptance item that cannot be automated.

Ships as markdown source + rendered PDF:

  • docs/runbooks/hubv3-human-in-the-loop-testing.md
  • docs/runbooks/hubv3-human-in-the-loop-testing.pdf (6 pages)

Why

HubV3 (docs/plans/2026-06-20-hubv3-contextualization-intake-prd.md) has a 12-item §6 matrix. 11 are automated-green — verified on feat/plc-mapper-gui @ 4eaa2dec: 174 tests across four lanes (Python Telegram intake 29, Hub TS unit 60, Hub TS import-integration vs live Postgres 4, mira-contextualizer 81). Item 12 spans two stacks (offline Contextualizer + cloud Hub), needs real fixtures, and ends in a human approval that publishes the model — the doctrinal core of "Hub owns truth." A person must witness it.

Contents

  • Automated-coverage floor + full §6 matrix mapping (so the tester doesn't re-test the green items by hand).
  • Prerequisites: dev/staging only, migrations 055+056, UUID tenant, the §7 fixtures (Micro820_v4.1.9_Program.st, MbSrvConf_v4.xml, gs10usermanual.pdf).
  • 11-step checklist with 12 blocking MUST boxes + per-step expected result and evidence (screenshot rule → docs/promo-screenshots/).
  • Optional Telegram leg that exercises §6 test 9 live (photo/PDF → same Hub import queue → proposed).
  • Sign-off table + defect log + env/safety reminders (no prod, read-only MIRA, apply-migrations.yml).

Docs-only; no code paths touched.

🤖 Generated with Claude Code

PRD §6 has 12 acceptance items; 11 are automated-green (174 tests across
Python + Hub TS unit + import integration + contextualizer lanes). Item 12 —
the cross-stack Garage Conveyor demo (offline → bundle → Hub import → batch/
dedupe/match/stage → human approval → publish → available to MIRA) — cannot be
fully automated: two stacks, real fixtures, and a human approval that publishes
the model. This runbook is the witness procedure for it.

Includes: automated-coverage floor + §6 matrix mapping, prerequisites (dev/
staging only, migs 055+056, UUID tenant, §7 fixtures), an 11-step checklist
with 12 blocking MUST boxes + per-step evidence (screenshot rule), an optional
Telegram leg exercising §6 test 9 live, and a sign-off table. Ships as markdown
source + rendered PDF (markdown→HTML→headless-Chrome).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@Mikecranesync

Copy link
Copy Markdown
Owner Author

Agent Pre-Flight Complete — Ready for Human Testing

An automated pre-flight of the HubV3 HITL acceptance testing runbook has been completed on branch feat/plc-mapper-gui.

Full report: docs/runbooks/hubv3-hitl-agent-preflight-report.md (commit 6677b2fd)

Summary

Check Result
Prerequisites ✅ All met
Automated test floor (172+ tests) ✅ All green
Offline bundle (Steps 1–4) ✅ Well-formed, all MUST boxes
Hub import/match/approve logic (Steps 6–10) ✅ Proven by integration tests
Hub UI flow ⏳ Needs human with browser
MIRA grounded answer (Step 11) ⏳ Needs full MIRA stack

Issues filed during pre-flight

The automated foundation is solid. A human tester can pick up from here: start the Hub dev server, walk through the UI import → review → approve flow, and verify MIRA answers a grounded question about the conveyor.

Mikecranesync pushed a commit that referenced this pull request Jun 27, 2026
… update hot.md + runbooks README

- docs/runbooks/hubv3-hitl-agent-preflight-report.md: add PR/issue cross-references,
  Round 13 finding pointers, and 'where this fits' context for the §6 test matrix.
- docs/runbooks/README.md: add HubV3 contextualization testing section linking the
  HITL runbook, pre-flight report, demo runbook, and rollback runbook.
- wiki/hot.md: update header to 2026-06-26 with HITL pre-flight status summary.

GitHub issues filed: #2320, #2321, #2322 (already filed).
Tracking issue: #2324.
Comments posted on: #2110, #2111, #2200.

Refs: #2134, #2141, #2142, #2324
Mike Harper added 2 commits June 26, 2026 20:37
- Add docs/runbooks/hubv3-hitl-agent-preflight-report.md: automated pre-flight
  results for the HubV3 human-in-the-loop acceptance testing runbook.
  Verifies prerequisites, automated test floor (172+ tests green), offline
  Contextualizer Steps 1-4 (bundle well-formed), and Hub import/match/approve
  logic via integration tests. Flags remaining human-only steps.

- Regenerate mira-hub/docs/SITEMAP.md + sitemap.snapshot.json: contextualization
  routes (4 pages + 10 API routes) were missing from the committed snapshot,
  causing sitemap-drift.test.ts to fail. Fixed by running 'bun run sitemap'.

Refs: #2142 (testing guide), #2134 (HubV3 P0-P6 merge)
… update hot.md + runbooks README

- docs/runbooks/hubv3-hitl-agent-preflight-report.md: add PR/issue cross-references,
  Round 13 finding pointers, and 'where this fits' context for the §6 test matrix.
- docs/runbooks/README.md: add HubV3 contextualization testing section linking the
  HITL runbook, pre-flight report, demo runbook, and rollback runbook.
- wiki/hot.md: update header to 2026-06-26 with HITL pre-flight status summary.

GitHub issues filed: #2320, #2321, #2322 (already filed).
Tracking issue: #2324.
Comments posted on: #2110, #2111, #2200.

Refs: #2134, #2141, #2142, #2324
@Mikecranesync

Copy link
Copy Markdown
Owner Author

Pre-flight report + sitemap fix cherry-picked into this PR

Two commits from feat/plc-mapper-gui have been cherry-picked into docs/hubv3-hitl-testing:

  1. f920e535docs(hubv3): HITL agent pre-flight report + fix sitemap snapshot drift

    • Adds docs/runbooks/hubv3-hitl-agent-preflight-report.md (the full pre-flight evidence)
    • Fixes sitemap drift (regenerates docs/SITEMAP.md + docs/sitemap.snapshot.json)
  2. 10faf08bdocs(hubv3): cross-reference HITL pre-flight report with issues/PRs + update hot.md + runbooks README

    • Cross-references PR/issue numbers in the report
    • Adds HubV3 testing section to docs/runbooks/README.md
    • Updates wiki/hot.md with 2026-06-26 session status

Pre-flight summary

Check Result
Prerequisites ✅ All met
Automated test floor (172+ tests, 4 lanes) ✅ All green
Offline bundle — Steps 1–4 (all MUST boxes) ✅ Well-formed
Hub import/match/approve logic — Steps 6–10 ✅ Proven by integration tests
Sitemap drift test ✅ Fixed and passing

Issues closed during pre-flight

Issues still open (docs improvements for this PR)

Tracking issue

This PR now carries both the human witness procedure AND the automated evidence that the foundation is solid. A human tester can start from Step 5.

…e 3 (#2321) + CHANGELOG entry

- hubv3-human-in-the-loop-testing.md §2: add 'How to run the automated floor'
  section with exact commands for all 4 lanes, including the vitest-not-bun-test
  warning (#2320) and the local Postgres setup for Lane 3 (#2321).
- docs/CHANGELOG.md: add HubV3 HITL pre-flight + sitemap fix entry documenting
  the agent pre-flight results, sitemap fix (#2322), and issues closed/opened.

Closes #2320, closes #2321.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant