Skip to content

Latest commit

 

History

History
57 lines (29 loc) · 5.53 KB

File metadata and controls

57 lines (29 loc) · 5.53 KB

Glossary

Definitions of terms used across the documentation to avoid ambiguity.

Lab terminology (hospital lab, pathology lab, blood sciences lab)

Documentation uses a three-level hierarchy for lab scope:

  • Hospital lab (broad): Any lab operated within or for a hospital—specimen handling, diagnostics, point-of-care, etc. LabTrust-Gym is framed as an environment for hospital lab automation in this broad sense.

  • Pathology lab (more precise): A lab that performs diagnostic pathology (histopathology, clinical chemistry, haematology, immunology, microbiology, etc.). Blood sciences is one branch of pathology.

  • Blood sciences lab / blood sciences lane (most precise): The specific workflow modeled in LabTrust-Gym: specimen reception, accessioning, pre-analytics, routine and STAT analytics, QC, critical result notification, release, and multi-site transport with chain-of-custody. The domain adapter is registered as hospital_lab; the simulated workflow is a blood sciences lane (a type of pathology lab within the broader hospital lab category).

When describing what is implemented, use "blood sciences lane" or "blood sciences lab." When describing applicability or motivation, use "hospital labs" or "pathology labs" as appropriate.

Baseline

The word "baseline" is used in two distinct ways in this repo:

  • Official baselines (v0.2): The frozen result set produced by labtrust generate-official-baselines. Used for regression: the baseline regression guard compares exact metrics against these results. Stored under benchmarks/baselines_official/v0.2/. Tasks are defined in benchmarks/baseline_registry.v0.1.yaml (core tasks and optionally coord_scale/coord_risk). Regenerate with labtrust generate-official-baselines --out benchmarks/baselines_official/v0.2/ --episodes N --seed N --force.

  • Baseline ID / coordination method (for coord tasks) — For coord_scale and coord_risk, the pack YAML key baselines maps each task to a coordination method ID (for example kernel_scheduler_or). The pack often uses a versioned name (for example kernel_scheduler_or_v0); the runner strips the _v0 suffix to obtain the method_id. In documentation, prefer coordination method or coord method when referring to this identifier so readers distinguish it from the official baselines (v0.2) result set.

See Official benchmark pack and Benchmark card for how these are used in practice.

RBAC

Role-based access control uses policy-driven roles and permissions (for example in policy/rbac/) that restrict which agents can perform which actions. Validated by labtrust validate-policy and enforced by the engine.

Simulation

The LabTrust-Gym benchmark and env runs: deterministic or stochastic runs under policy. Used for development, regression, and security/safety evaluation.

Production

Live deployment with real users, keys, and adversaries. Production assurance remains the integrator's responsibility; simulation results inform but do not replace site-specific validation. See Threat model and State of the art and limits.

UI bundle and coordination artifacts

  • UI bundle: A zip produced by labtrust ui-export --run <dir> --out <zip>. Contains index.json, events.json, receipts_index.json, reason_codes.json, and when the run has coordination pack output, a coordination_artifacts list and files under coordination/ (tables and HTML charts). See UI data contract, Frontend handoff.

  • coordination_artifacts: In index.json, an optional list of { path, label } entries. Each file is stored in the zip at coordination/ + path. Includes pack_summary, SOTA leaderboard (main and full), method-class comparison, and coordination/graphs/ HTML charts (SOTA key metrics, throughput, violations, resilience, method-class).

  • SOTA leaderboard: State-of-the-art (SOTA) summary table(s) produced by summarize-coordination from pack_summary or summary_coord: main table (key metrics, run metadata), full table (all numeric aggregates), and method-class comparison. These are written to summary/ and included in the UI bundle when present. See Hospital lab key metrics.

Proof-Carrying Science (PCS)

  • PCS (Proof-Carrying Science): A pattern where a workflow exports versioned JSON artifacts (trace, runtime receipt, science claim bundle, handoffs, certificates) validated by pcs-core. LabTrust-Gym ships a reference QC-release workflow under examples/pcs_qc_release/. See PCS overview.

  • pcs-core / pcs-bench: Upstream schema and benchmark tooling repositories. LabTrust is a producer of reproducibility ingest bundles consumable by pcs-bench.

Evidence bundle and release directory

  • Evidence bundle (EvidenceBundle.v0.1): A directory produced by labtrust export-receipts from an episode log; contains manifest, schema, hashchain, and invariant trace. Verified by labtrust verify-bundle. See Trust verification.

  • Release directory: Output directory of labtrust package-release; contains MANIFEST.v0.1.json, receipts/, FIGURES/, and optionally RISK_REGISTER_BUNDLE. After labtrust build-release-manifest --release-dir <dir> and labtrust verify-release --release-dir <dir>, the directory is a verifiable release artifact.