Definitions of terms used across the documentation to avoid ambiguity.
Documentation uses a three-level hierarchy for lab scope:
-
Hospital lab (broad): Any lab operated within or for a hospital—specimen handling, diagnostics, point-of-care, etc. LabTrust-Gym is framed as an environment for hospital lab automation in this broad sense.
-
Pathology lab (more precise): A lab that performs diagnostic pathology (histopathology, clinical chemistry, haematology, immunology, microbiology, etc.). Blood sciences is one branch of pathology.
-
Blood sciences lab / blood sciences lane (most precise): The specific workflow modeled in LabTrust-Gym: specimen reception, accessioning, pre-analytics, routine and STAT analytics, QC, critical result notification, release, and multi-site transport with chain-of-custody. The domain adapter is registered as
hospital_lab; the simulated workflow is a blood sciences lane (a type of pathology lab within the broader hospital lab category).
When describing what is implemented, use "blood sciences lane" or "blood sciences lab." When describing applicability or motivation, use "hospital labs" or "pathology labs" as appropriate.
The word "baseline" is used in two distinct ways in this repo:
-
Official baselines (v0.2): The frozen result set produced by
labtrust generate-official-baselines. Used for regression: the baseline regression guard compares exact metrics against these results. Stored underbenchmarks/baselines_official/v0.2/. Tasks are defined inbenchmarks/baseline_registry.v0.1.yaml(core tasks and optionally coord_scale/coord_risk). Regenerate withlabtrust generate-official-baselines --out benchmarks/baselines_official/v0.2/ --episodes N --seed N --force. -
Baseline ID / coordination method (for coord tasks) — For coord_scale and coord_risk, the pack YAML key
baselinesmaps each task to a coordination method ID (for examplekernel_scheduler_or). The pack often uses a versioned name (for examplekernel_scheduler_or_v0); the runner strips the_v0suffix to obtain the method_id. In documentation, prefer coordination method or coord method when referring to this identifier so readers distinguish it from the official baselines (v0.2) result set.
See Official benchmark pack and Benchmark card for how these are used in practice.
Role-based access control uses policy-driven roles and permissions (for example in policy/rbac/) that restrict which agents can perform which actions. Validated by labtrust validate-policy and enforced by the engine.
The LabTrust-Gym benchmark and env runs: deterministic or stochastic runs under policy. Used for development, regression, and security/safety evaluation.
Live deployment with real users, keys, and adversaries. Production assurance remains the integrator's responsibility; simulation results inform but do not replace site-specific validation. See Threat model and State of the art and limits.
-
UI bundle: A zip produced by
labtrust ui-export --run <dir> --out <zip>. Containsindex.json,events.json,receipts_index.json,reason_codes.json, and when the run has coordination pack output, a coordination_artifacts list and files undercoordination/(tables and HTML charts). See UI data contract, Frontend handoff. -
coordination_artifacts: In
index.json, an optional list of{ path, label }entries. Each file is stored in the zip atcoordination/+path. Includes pack_summary, SOTA leaderboard (main and full), method-class comparison, and coordination/graphs/ HTML charts (SOTA key metrics, throughput, violations, resilience, method-class). -
SOTA leaderboard: State-of-the-art (SOTA) summary table(s) produced by
summarize-coordinationfrom pack_summary or summary_coord: main table (key metrics, run metadata), full table (all numeric aggregates), and method-class comparison. These are written tosummary/and included in the UI bundle when present. See Hospital lab key metrics.
-
PCS (Proof-Carrying Science): A pattern where a workflow exports versioned JSON artifacts (trace, runtime receipt, science claim bundle, handoffs, certificates) validated by pcs-core. LabTrust-Gym ships a reference QC-release workflow under
examples/pcs_qc_release/. See PCS overview. -
pcs-core / pcs-bench: Upstream schema and benchmark tooling repositories. LabTrust is a producer of reproducibility ingest bundles consumable by pcs-bench.
-
Evidence bundle (EvidenceBundle.v0.1): A directory produced by
labtrust export-receiptsfrom an episode log; contains manifest, schema, hashchain, and invariant trace. Verified bylabtrust verify-bundle. See Trust verification. -
Release directory: Output directory of
labtrust package-release; contains MANIFEST.v0.1.json, receipts/, FIGURES/, and optionally RISK_REGISTER_BUNDLE. Afterlabtrust build-release-manifest --release-dir <dir>andlabtrust verify-release --release-dir <dir>, the directory is a verifiable release artifact.