RobotEscapeRoom

_{Every cost function, one self-solving escape game — map + camera + 3D sim.
Three panels: stacked-floor topology map (2D),
robot camera · rgb (first-person furnished interior),
and isometric 3D sim · furnished rooms (imported OBJ meshes).
Colour legend: cyan = traveled, pink = planned, red = locked.
Every GIF frame is one real A* planner step — room quests and puzzle
captions show items, riddles, and the Floor-3 decoy twist. Live stack:
Gazebo + AMCL + Nav2 + escape_room_runner dynamic replan.
Interactive Foxglove replay:
docs/foxglove/robot_escape_room_demo.mcap.
Robot T-0 recomposes
block_edges · block_edge_types ·
avoid_restricted · prefer_elevator ·
resolve_goal each turn — no scripted route. The lit
EMERGENCY EXIT on Floor 3 is a decoy; the real way out is
the sublevel. Play it with
python examples/robot_escape_room.py.
Regenerate the hero:
PYTHONPATH=. python3 examples/generate_escape_room_meshes.py then
scripts/foxglove_hero/build_escape_room_gif.sh
(Foxglove MCAP first:
PYTHONPATH=. python3 examples/export_escape_room_foxglove_mcap.py).
Other variants:
python examples/record_escape_room_sim.py → dashboard GIF;
python examples/record_escape_room.py → three-panel analytics GIF;
./scripts/record_escape_room_gz_sim.sh → Gazebo overview MP4
(docs/images/robot_escape_room_gz.mp4).
Gallery write-up.}

Grounded middle planning layer for robot navigation. Bridges dense maps (SLAM / occupancy / HD) and motion executors (Nav2 / Autoware / MPPI / learned policies) with a graph-level layer that decides where to go, why, and who first — under language goals, calendar-aware closures, soft preferences, deadlines, and multi-agent reservations. Pure-Python core, zero hard dependencies, full Protocol conformance suites.

Use it when a robot stack already has local motion, but still needs:

language goals grounded into stable topology node ids;
semantic A* routes over rooms, corridors, elevators, stairs, closures, and preferences;
multi-robot reservation/admission decisions with explainable denial reasons.

What it does

Three orthogonal axes, all composable:

Plan

Routes on semantic graphs with composable cost rules. compose_costs stacks avoid_stairs / prefer_elevator / block_edges / time_aware / preference_aware / reservation_aware / floor_change_penalty and a dozen others into a single A* call. No re-implementations per scenario — declare what you want, the planner honors it. Hand the graph to the ROS 2 Nav2 Route Server with semantic-toponav export-nav2 (topology_to_nav2_geojson) — this is the planning tier that feeds Nav2, not a rival to it.

Coordinate

Multi-agent fleets with atomic reservations and seven strategies: greedy / priority / deadline / joint / bnb (branch-and-bound, 3 objectives) / exhaustive (MIS upper bound) / insert (insertion-based repair). Hard deadline admission with a structured reason_code so denials are explainable. Optional in-process or HTTP scheduler for fan-out across processes.

Resolve

Natural-language goals → node ids. The deterministic floor (bag-of-words + floor parsing) always runs first; an LLM may rewrite prose or re-rank the top-k pool but cannot invent node ids — out-of-pool picks silently fall back, a property adversarially audited at a 0.00 leak rate (hallucinated ids, prompt-injection, payloads, near-misses — see eval/no_invent.py). Multi-turn DialogSession for ambiguous queries; optional CLIP / VLM cosine retrieval for embedding-grounded resolves.

See each axis run. Four worked demos in one style — input → a scored decision → the result, every bar and route from real API output:

🎮 Escape room (top) — puzzles as planner primitives, emergent six-turn escape;
🗣️ Language grounding → route — a sentence → resolve_goal scores → the A* route up the elevator;
📷 Visual localization → navigation — a camera frame → CLIP cosine → route progress to the goal;
🚦 Multi-agent coordination — fleet requests → the strategy decision → who gets the chain.

Quick start

pip install -e .
semantic-toponav plan          examples/indoor_office.yaml entrance meeting_room
semantic-toponav waypoints     examples/indoor_office.yaml entrance office_2f --avoid-stairs --prefer-elevator
semantic-toponav describe-path examples/indoor_office.yaml entrance office_2f --avoid-stairs --prefer-elevator

from semantic_toponav.graph.serialization import load_graph
from semantic_toponav.planner import (
    plan_astar, avoid_stairs, prefer_elevator, compose_costs,
)
from semantic_toponav.waypoint import path_to_semantic_waypoints

graph = load_graph("examples/indoor_office.yaml")
path = plan_astar(graph, "entrance", "office_2f",
                  cost_fn=compose_costs(avoid_stairs, prefer_elevator))
for wp in path_to_semantic_waypoints(graph, path):
    print(wp.instruction)

New here? Run the ten-minute tour (python examples/ten_minute_tour.py) for a single-file walkthrough of the three axes — Resolve, Plan, Coordinate — on the shipped multi_floor_office.yaml graph. No plotting, no LLM credentials, runs in under a second.

For a deeper read, walk through the three-floor tutorial end-to-end.

Gallery

Language grounding → route

The language twin of the page hero: where that one grounds a camera frame to a place, this grounds a sentence. resolve_goal scores every node by a bag-of-words + floor-aware match, the top node becomes the goal, and plan_astar rides the elevator up to it.

_{The query "executive office on 3F" parses to
floor 3 + tokens {executive, office};
resolve_goal scores Executive Office at 7
(floor + both labels) clear of the four floor-only 3F candidates at 3,
and plan_astar(..., prefer_elevator) climbs
entrance → corridor → elevator 1F→2F→3F → executive office.
Every bar and green leg is real output from the deterministic resolver
and planner — no model, no API key. Regenerate with
python examples/record_language_hero.py.}

Cost composition

The same graph re-planned under different cost stacks. The path changes; nothing about the graph does.

default A*	+ avoid_stairs + prefer_elevator

default to meeting room	+ restricted-edge avoidance

Multi-floor planning

floor_change_penalty, prefer_floor, same_floor_only, and a floor_aware_heuristic make multi-storey layouts a first-class target — no per-floor sub-graphs needed.

default (cheapest stairs route)	prefer_elevator

prefer_floor=2 (bias toward 2F)	floor_change_penalty (avoid hopping floors)

Escape room — every cost function in one self-solving game

The page hero is the 3D Foxglove replay GIF above (export_escape_room_foxglove_mcap.py + build_escape_room_gif.sh). A Foxglove dashboard variant lives at docs/images/robot_escape_room_dashboard.gif; a three-panel analytics variant at docs/images/robot_escape_room_panels.gif; a Gazebo overview MP4 at docs/images/robot_escape_room_gz.mp4 (./scripts/record_escape_room_gz_sim.sh). The gallery above shows each feature in isolation; the escape room ties them together. A robot, T-0, wakes in a locked-down facility and has to reason its way out. Each puzzle is a thin narrative skin over a real planner primitive:

Puzzle	Primitive
Keycard lock	`block_edges` until the matching item is held
Dark corridor	`block_edge_types("unpowered")` until the power core is collected
Laser shortcut	`avoid_restricted` — shown via reckless-vs-safe briefing at startup
Stairs vs lift	`prefer_elevator` — cheaper stairs exist, T-0 rides the lift
Riddle terminal	`resolve_goal` grounds the clue and reveals hidden items

There is no scripted route — each turn the runner recomposes the current cost stack, asks A* what is reachable now, walks to the nearest objective, and re-plans. The twist is structural: a lit EMERGENCY EXIT on Floor 3 is welded shut (master_seal — no key exists); a control-room riddle grounds "maintenance exit" to the sublevel and hands over the hatch code, flipping the route from all-the-way-up to all-the-way-down.

Gazebo / gz-sim: furnished facility mesh + interior collision boxes — open in Harmonic with:

PYTHONPATH=. python3 examples/generate_escape_room_meshes.py
PYTHONPATH=. python3 examples/generate_escape_room_gazebo_world.py
export GZ_SIM_RESOURCE_PATH="$(pwd)/examples/meshes/escape_room/gazebo/models:$GZ_SIM_RESOURCE_PATH"
gz sim examples/meshes/escape_room/gazebo/escape_room.world

See examples/meshes/escape_room/gazebo/README.md.

Nav2: export the topology with python examples/export_escape_room_nav2_route.py → examples/data/nav2/escape_room_graph.geojson.

Full sim stack: ./scripts/run_escape_room_gz_nav2.sh launches Gazebo + ros_gz_bridge + Nav2 + semantic waypoint following — see ros2/README.md.

Record Gazebo MP4: ./scripts/record_escape_room_gz_sim.sh replays the shipped timeline, drives T-0, and captures the overview camera → docs/images/robot_escape_room_gz.mp4. When gz-sim camera output is blank (headless / no GPU), the script falls back to a CPU overview renderer that matches the Gazebo camera pose.

Conversion pipeline

Topology graphs can be authored by hand or generated from existing artifacts: occupancy grids via skeletonization + clearance-aware door detection + region segmentation, or trajectory logs (CSV / rosbag2) via greedy clustering.

occupancy grid → topology	path on the auto-generated graph

trajectory log → topology	CSV trajectory (no pandas)

VLM region embedding

After annotate_regions carves a graph into rooms, embed_region_patches stamps an encoder vector onto every node in each region (CLIP, Hashing, or any Backend-conforming adapter). At query time the same vector can be used to retrieve nodes by cosine similarity — the same wire format the LLM resolver consumes as embedding_score= context.

_{Three query regions, three different highlight patterns. The
example uses the dependency-free HashingBackend; swap in
CLIPBackend + an AlignedRgbSource to ground
text queries on real photographs. Reproduce via
python examples/vlm_region_embedding_demo.py.}

Visual localization & navigation

The perception twin of the language hero: where that one grounds a sentence to a place, this grounds a camera frame. localize_by_image embeds the frame with a real CLIP encoder and ranks per-node gallery vectors by cosine similarity; stacking it with the planner closes an LM-Nav-style loop — plan_visual_route, A* to a goal, monotonic progress via VisualRouteFollower.

_{Camera frame → CLIP cosine bars → route progress
(1/5 → 5/5). Every bar and green leg is real
CLIPBackend output on Gazebo Depot frames — not a
mock-up. Regenerate with python examples/record_visual_hero.py.
On the five-place benchmark every drive frame grounds at
precision@1 = 1.00
(report).
Locomotion stays out of repo — ViNT / NoMaD or Nav2 owns how to
move, this owns where on the plan the robot is
(related_work.md). Also see the
per-frame primitive:
python examples/visual_localization_demo.py /
examples/visual_navigation_demo.py.}

Multi-agent coordination

The same scheduler under four ordering strategies. The scenario is intentionally adversarial — a long-haul agent is submitted first, so naive greedy locks every other agent out (1/5 granted). Branch-and- bound and the exhaustive MIS baseline reorder the queue and fit four short-haul agents into disjoint segments (4/5 granted).

_{The Coordinate twin of the visual and language
heroes, in the same three-panel style: the requests (five
agents on one chain, the long-haul listed first), the decision
(agents granted per strategy), and the outcome for the strategy
in focus. greedy / priority → 1/5 (submission order locks the long-haul
in, denying everyone else); bnb / exhaustive → 4/5 (hold the long-haul
back, four shorts tile disjoint segments). Every number is real output
from plan_fleet_with_strategy on an identically-seeded
SharedScheduler. Regenerate via
python examples/record_coordination_hero.py; the cycling
per-strategy graph view (17_coordination_cycle.gif) and the
static 2×2 (16_coordination_strategies.png) come from
examples/coordination_strategies_demo.py.}

Features

Area	What's there	Docs
Map / log conversion	Occupancy grid, door detection, region segmentation, graph compaction, trajectories, CSV / rosbag2 / ROS map_server	conversion.md
Cost composition	`avoid_` / `prefer_` / `block_*`, time-of-day windows, calendar-aware closures, soft preferences (node / edge), static reservations, multi-floor heuristics	cost_composition.md
Multi-agent coordination	`SharedScheduler` + RPC shim (HTTP / custom), `plan_fleet_with_strategy` (7 strategies), branch-and-bound + fairness objectives, exhaustive-MIS upper bound, insertion-based repair, deadline admission, scheduler persistence, synthetic eval suite	coordination.md
Semantic queries + LLM/VLM	`find_nodes` / `nearest_*` / `resolve_goal`, embedding retrieval, CLIP backend, `llm_resolve_goal` + `DialogSession` (multi-turn), mid-traversal describer rewrite, visit-history memory	queries.md
CLI reference	All subcommands and flags	cli.md
Visualization	matplotlib `plot`, interactive pyvis HTML viewer, live-reloading viewer	see below
Schema	YAML v1 graph format + six v1-locked JSON wire schemas (waypoint array, plan / fleet result, conflict explanation, resolve trace, preference metadata)	schema_v1.md · waypoint_schema.md
Protocol conformance	Reusable suites under `semantic_toponav.testing.conformance` for `LLMBackend` / encoder `Backend` / `AlignedRgbSource` / `SchedulerProtocol` / `Transport` / `ConflictPolicy` with failure-mode depth	conformance.md
Language-grounding eval	YAML gold-corpus driver for `resolve_goal` / `llm_resolve_goal` (precision@1, top-k recall, clarification / fp-resolve / abstention rates) + describer-rewrite safety invariants for `llm_describe_path`	eval_grounding.md · sample report
ROS2 integration	`graph_loader` / `waypoint_publisher` / `nav2_demo` nodes	ros2/README.md

Visualization

pip install -e '.[viz]'
semantic-toponav plot examples/indoor_office.yaml \
    --start entrance --goal office_2f \
    --avoid-stairs --prefer-elevator --save route.png

pip install -e '.[viz_web]'
semantic-toponav viewer examples/multi_floor_office.yaml \
    --start entrance --goal exec_office_3f --prefer-elevator \
    --output viewer.html

semantic-toponav live-viewer examples/multi_floor_office.yaml

The web viewer is a fully offline self-contained HTML file — nodes are draggable, hovering surfaces type / cost / property tooltips, and the highlighted path is overlaid in pink. live-viewer adds a file-watch loop so edits to the YAML reload the browser tab.

Foxglove replay

_{Replay of real planner output — semantic topology, robot pose, route, and waypoint stream — rendered headless in open-source Foxglove (Lichtblick) from the shipped MCAP. Open it yourself: drop docs/foxglove/semantic_toponav_demo.mcap into Foxglove Studio — see docs/foxglove/README.md for the panel setup, or scripts/foxglove_hero/ to regenerate this GIF.}

pip install -e '.[foxglove]'
python examples/export_foxglove_mcap.py

Open docs/foxglove/semantic_toponav_demo.mcap in Foxglove Studio. It contains /semantic_toponav/scene as foxglove.SceneUpdate, /tf as foxglove.FrameTransforms, /semantic_toponav/pose as foxglove.PoseInFrame, /semantic_toponav/markers as visualization_msgs/MarkerArray, and semantic route / waypoint / resolve topics from the same planner run shown in the README demo.

Graph schema (v1)

version: 1
metadata: {name: indoor_office, frame_id: map}
nodes:
  - id: entrance
    label: Entrance
    type: entrance
    pose: {x: 0.0, y: 0.0, yaw: 0.0, frame_id: map}
    properties: {}
edges:
  - id: entrance_to_corridor
    source: entrance
    target: corridor_main
    type: traversable
    cost: 1.0
    bidirectional: true
    properties: {}

Node type examples: corridor, room, intersection, elevator, stairs, entrance. Edge type examples: traversable, stairs_up, stairs_down, elevator_connection, restricted, one_way. pose is optional — without it A* degrades to Dijkstra.

For a fluent builder API, see semantic_toponav.graph.GraphBuilder (documented in tutorial.md).

What this project is not

Deliberately out of scope (use existing systems):

Low-level control (MPC / MPPI)
Obstacle avoidance / SLAM / dense occupancy planning
Behavior trees
Head-to-head MAPF solver on gridworld (that's CBS / EECBS / MAPF-LNS2 territory; this layer sits above pure grid MAPF and adds semantic / time / language constraints instead)

The split is where to go (this repo) vs how to move locally (Nav2 / Autoware / your motion executor):

Layer	Responsibility	Owned by
Global semantic-topological planning	where / why / who first	this repository
Local motion execution	how to move locally	Nav2 / MPPI / policy

Status

Feature-complete across the original roadmap and the 25-PR post-MVP arc: synthetic eval suite, branch-and-bound + fairness objectives, HTTP transport, exhaustive MIS baseline, scheduler persistence, public Protocol conformance suites with failure-mode depth, calendar-aware closures, soft preferences (edge + node defaults), mid-traversal LLM rewrites, insertion-based fleet repair, language-grounding eval suite, and v1.0 schema lock across six wire formats. See docs/decisions.md for design notes, docs/experiments.md for the full feature index, and docs/paper_outline.md for the working outline of the paper that organizes the post-MVP arc.

Six public wire formats are v1-locked under schemas/: SemanticWaypointArray (waypoint publisher), PlanWithSchedulerResult + FleetPlanResult (fleet admission), ConflictExplanation (CBS-lite diagnostics), ResolveTrace (language grounding), and the preferences metadata convention. See docs/schema_v1.md for the freeze policy and CHANGELOG.md for the consolidated v1.0 release notes spanning PR #1–#62.

Tests

pytest -q                              # 875 tests, ~20s
ruff check .

License

Apache-2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
.github		.github
docs		docs
examples		examples
ros2		ros2
schemas		schemas
scripts		scripts
semantic_toponav		semantic_toponav
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
STATUS_FOR_ADVICE.md		STATUS_FOR_ADVICE.md
plan.md		plan.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RobotEscapeRoom

What it does

Plan

Coordinate

Resolve

Quick start

Gallery

Language grounding → route

Cost composition

Multi-floor planning

Escape room — every cost function in one self-solving game

Conversion pipeline

VLM region embedding

Visual localization & navigation

Multi-agent coordination

Features

Visualization

Foxglove replay

Graph schema (v1)

What this project is not

Status

Tests

License

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RobotEscapeRoom

What it does

Plan

Coordinate

Resolve

Quick start

Gallery

Language grounding → route

Cost composition

Multi-floor planning

Escape room — every cost function in one self-solving game

Conversion pipeline

VLM region embedding

Visual localization & navigation

Multi-agent coordination

Features

Visualization

Foxglove replay

Graph schema (v1)

What this project is not

Status

Tests

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages