Move model provisioning policies out of the simulator by James-QiuHaoran · Pull Request #311 · Azure/realtimevideogen

James-QiuHaoran · 2026-05-15T23:00:32Z

This PR refactored the code to decouple the model provisioning policies (e.g., greedy, MILP) from the simulator. In addition, in the StreamWise dashboard, the user can auto-deploy a workflow with resource-optimized allocation plan.

Delete 17 shim files from simulator/ that re-exported from streamwise.model_provisioner. Update simulator/__init__.py to add streamwise/ to sys.path so model_provisioner is importable. Update imports in simulator/provisioning.py, multirequests.py, and plot_utils.py to use model_provisioner.* prefixed imports. Update all 19 test files in tests/simulator/ to: - Pass both 'simulator' and 'streamwise' to temp_sys_path - Use model_provisioner.* prefixed imports for moved modules - Fix patch.dict target in test_models.py - Fix inline import in test_hexgen.py Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Move actions, auto_model_allocator, constants, data_loading, evaluator, model_allocator, models, sim_types, sim_types_json, utils, and workflows from streamwise/model_provisioner/ back to simulator/. Only 6 policy files remain in model_provisioner: greedy, helix, hexgen, milp, naive_baseline, and policies. Import changes: - Moved files use bare imports (from sim_types import ...) instead of relative imports (from .sim_types import ...) - Policy files use bare imports for moved modules and keep relative imports for sibling policy modules - simulator/ and streamwise/allocator_bridge.py updated accordingly - All test files updated to match new import paths - Added tests/simulator/conftest.py to set PYTHONPATH for child processes spawned by ProcessPoolExecutor Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

This PR refactors GPU allocation policies (greedy, MILP, HexGen, Helix, naive) out of simulator/ into a new streamwise/model_provisioner/ package, and adds an "Auto Deploy" feature in the StreamWise dashboard that runs the allocator against a user-supplied GPU budget and deploys the resulting plan via pod_manager.

Changes:

New streamwise/model_provisioner/ package containing the policies and allocators (greedy/naive/MILP/HexGen/Helix); imports throughout simulator/tests updated to reference the new location.
New streamwise/allocator_bridge.py that translates allocator Result objects into DeploymentSpecs for pod_manager.add_pod, plus three new endpoints in streamwise.py (/api/auto_deploy, /api/auto_deploy/confirm, /api/auto_deploy/workflows) and corresponding UI in add_pod.html.
Tests added under tests/streamwise/ for the bridge and endpoints; sys.path plumbing added in simulator/__init__.py, simulator/provisioning.py, and tests/simulator/conftest.py so the cross-package imports work in subprocess workers.

Reviewed changes

Copilot reviewed 40 out of 42 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
streamwise/model_provisioner/init.py, policies.py, greedy.py, naive_baseline.py, milp.py, hexgen.py, helix.py	New package containing the relocated allocation policies/allocators with relative imports.
streamwise/allocator_bridge.py	New bridge mapping allocator Results to deployment specs and exposing run_allocator/get_available_* helpers.
streamwise/streamwise.py	Adds three auto-deploy API endpoints.
streamwise/templates/add_pod.html	Adds the Auto Deploy form, results table, and JS to call the new endpoints.
simulator/init.py, simulator/provisioning.py	Adds sys.path/PYTHONPATH plumbing so model_provisioner is importable in worker processes.
simulator/data_loading.py	Switches default `data_dir` to an absolute path computed from `__file__` (path computation appears off by one).
simulator/auto_model_allocator.py, actions.py, model_allocator.py, multirequests.py	Updated import paths to `model_provisioner.*`.
tests/simulator/*.py	Adjusted `temp_sys_path` and import paths to the new package layout.
tests/simulator/conftest.py, tests/streamwise/conftest.py	New conftests adding paths so child processes/lazy imports resolve.
tests/streamwise/test_allocator_bridge.py, test_streamwise_auto_deploy.py	New tests for bridge logic and auto-deploy endpoints.
.gitignore	Ignore `.venv/`.

Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>

goiri · 2026-05-15T23:07:12Z

                    });
                });
            }
+            // Auto-Deploy


Maybe we should split the PR into two and have one just for the module and then one for the setting?

github-actions · 2026-05-15T23:50:16Z

Test Results

1 169 tests +25 1 169 ✅ +25 14m 56s ⏱️ -2s
13 suites ± 0 0 💤 ± 0
13 files ± 0 0 ❌ ± 0

Results for commit a0636f8. ± Comparison against base commit c20eb25.

♻️ This comment has been updated with latest results.

kdh0102

Thanks for the PR. Most of the changes look good to me, but currently having some issues with importing the model_provisioner package inside the simulator. Especially, I've been testing the two jupyter notebooks (cost_estimator_*.ipynb), but failing to import model_provisioner properly.
I'm first leaving this review and will do another pass.

kdh0102 · 2026-05-15T23:35:40Z

 from workflows import PODCAST_WORKFLOW

-from policies import STREAMWISE_POLICY
+from model_provisioner.policies import STREAMWISE_POLICY


Cannot import model_provisioner when running inside the cost_estimator_multirequests.ipynb

github-actions · 2026-05-16T00:09:29Z

Lint Results

Check	Status
Python	✅
Shell	✅
YAML	✅
JSON	✅
Markdown	✅
Bicep	✅

github-actions · 2026-05-16T00:12:16Z

Mypy Type Checking

✅ No issues found

Metric	Count
📂 Files	299
❌ Errors	0
⚠️ Warnings	0
📝 Notes	0

Full mypy output

Success: no issues found in 299 source files

github-actions · 2026-05-16T00:28:03Z

Diff Coverage

Diff: origin/main...HEAD, staged and unstaged changes

simulator/init.py (0.0%): Missing lines 7-8,11,14-15
simulator/actions.py (100%)
simulator/auto_model_allocator.py (100%)
simulator/data_loading.py (100%)
simulator/model_allocator.py (100%)
simulator/multirequests.py (100%)
simulator/provisioning.py (100%)
streamwise/allocator_bridge.py (100%)
streamwise/model_provisioner/init.py (100%)
streamwise/model_provisioner/greedy.py (100%)
streamwise/model_provisioner/helix.py (100%)
streamwise/model_provisioner/hexgen.py (100%)
streamwise/model_provisioner/milp.py (100%)
streamwise/model_provisioner/naive_baseline.py (100%)
streamwise/streamwise.py (76.5%): Missing lines 763-765,805-806,821-824,833-835
tests/simulator/test_auto_model_allocator.py (100%)
tests/simulator/test_data_loading.py (100%)
tests/simulator/test_evaluator.py (100%)
tests/simulator/test_greedy.py (100%)
tests/simulator/test_helix.py (100%)
tests/simulator/test_hexgen.py (100%)
tests/simulator/test_milp.py (100%)
tests/simulator/test_models.py (100%)
tests/simulator/test_multirequests_derive.py (100%)
tests/simulator/test_simulator.py (100%)
tests/simulator/test_simulator_actions.py (100%)
tests/simulator/test_simulator_baseline.py (100%)
tests/simulator/test_simulator_energy.py (100%)
tests/simulator/test_simulator_multirequests.py (100%)
tests/simulator/test_simulator_plotutils.py (100%)
tests/simulator/test_simulator_policies.py (100%)
tests/simulator/test_simulator_provisioning.py (100%)
tests/simulator/test_simulator_types.py (100%)
tests/simulator/test_simulator_utils.py (100%)
tests/simulator/test_workflows.py (100%)
tests/streamwise/test_allocator_bridge.py (100%)
tests/streamwise/test_streamwise_auto_deploy.py (100%)
wrapper/run_httpserver.py (0.0%): Missing lines 1269-1270

Summary

Total: 418 lines
Missing: 19 lines
Coverage: 95%

simulator/init.py

Lines 3-16

   3 on top of the model_provisioner allocation policies.
   4 
   5 The allocation policy implementations live in ``streamwise/model_provisioner/``.
   6 """
!  7 import os
!  8 import sys
   9 
  10 # Make model_provisioner importable for simulator modules.
! 11 _STREAMWISE_DIR = os.path.normpath(
  12     os.path.join(os.path.dirname(os.path.abspath(__file__)), "..", "streamwise")
  13 )
! 14 if _STREAMWISE_DIR not in sys.path:
! 15     sys.path.insert(0, _STREAMWISE_DIR)

streamwise/streamwise.py

Lines 759-769

  759         return jsonify(allocator_bridge.deployment_plan_to_json(plan)), HTTPStatus.OK
  760 
  761     except ValueError as ve:
  762         return jsonify({"error": str(ve)}), HTTPStatus.BAD_REQUEST
! 763     except Exception as ex:
! 764         logging.exception("Error in auto_deploy: %s", ex)
! 765         return jsonify({"error": str(ex)}), HTTPStatus.INTERNAL_SERVER_ERROR
  766 
  767 
  768 @route("/api/auto_deploy/confirm", methods=["POST"])
  769 async def api_auto_deploy_confirm() -> QuartReturn:

Lines 801-810

  801 
  802         for spec in specs:
  803             container_name = spec.get("container_name")
  804             if not container_name:
! 805                 errors.append("Spec missing 'container_name'")
! 806                 continue
  807 
  808             try:
  809                 await pod_manager.add_pod(
  810                     container_name=container_name,

Lines 817-828

  817                     namespace=NAMESPACE,
  818                     k8s_cluster=k8s_cluster,
  819                 )
  820                 deployed.append(container_name)
! 821             except Exception as pod_ex:
! 822                 msg = f"Failed to deploy '{container_name}': {pod_ex}"
! 823                 logging.error(msg)
! 824                 errors.append(msg)
  825 
  826         status = HTTPStatus.OK if not errors else HTTPStatus.MULTI_STATUS
  827         return jsonify({
  828             "deployed": deployed,

Lines 829-839

  829             "errors": errors,
  830             "message": f"Deployed {len(deployed)}/{len(specs)} containers.",
  831         }), status
  832 
! 833     except Exception as ex:
! 834         logging.exception("Error in auto_deploy/confirm: %s", ex)
! 835         return jsonify({"error": str(ex)}), HTTPStatus.INTERNAL_SERVER_ERROR
  836 
  837 
  838 @route("/api/auto_deploy/workflows", methods=["GET"])
  839 async def api_auto_deploy_workflows() -> QuartReturn:

wrapper/run_httpserver.py

Lines 1265-1274

  1265     last_ping_time = time.time()
  1266 
  1267     try:
  1268         payload_bytes = await asyncio.to_thread(pickle.dumps, gen_task)
! 1269         payload_buffer = bytearray(payload_bytes)
! 1270         payload_tensor = torch.frombuffer(payload_buffer, dtype=torch.uint8).to("cuda")
  1271         payload_size = torch.tensor([payload_tensor.numel()], dtype=torch.int64, device="cuda")
  1272 
  1273         if payload_size.item() > MAX_PAYLOAD_BYTES:
  1274             logging.error(f"[{rank}] Payload too large: {payload_size.item()} bytes.")

github-actions · 2026-05-16T00:28:23Z

Package	Line Rate	Health
.	80%	✔
apps	67%	✔
apps.streamanimate	89%	✔
apps.streamcast	91%	✔
apps.streamchat	88%	✔
apps.streamdub	64%	✔
apps.streamedit	79%	✔
apps.streamlecture	94%	✔
apps.streammovie	89%	✔
apps.streampersona	64%	✔
apps.streamshort	71%	✔
simulator	90%	✔
streamwise	71%	✔
streamwise.model_provisioner	83%	✔
tests	99%	✔
tests.simulator	100%	✔
tests.streamwise	100%	✔
tests.streamwise_app	99%	✔
wrapper	67%	✔
wrapper.4kagent	94%	✔
wrapper.bagel	37%	❌
wrapper.cogview	45%	➖
wrapper.fantasytalking	55%	➖
wrapper.flux	74%	✔
wrapper.flux2	100%	✔
wrapper.flux2klein	99%	✔
wrapper.fluxkontext	100%	✔
wrapper.fluxkrea	99%	✔
wrapper.fluxupscaler	68%	✔
wrapper.hidream	88%	✔
wrapper.hunyuanavatar	74%	✔
wrapper.hunyuanframepack	52%	➖
wrapper.hunyuanframepackf1	59%	➖
wrapper.hunyuanframepackvae	63%	✔
wrapper.hunyuanimage	84%	✔
wrapper.imageresize	100%	✔
wrapper.januspro	91%	✔
wrapper.kokoro	87%	✔
wrapper.llamagen	65%	✔
wrapper.mock	86%	✔
wrapper.podcasttranscript	45%	➖
wrapper.qwenimage	89%	✔
wrapper.qwenimageedit	88%	✔
wrapper.realesrgan	77%	✔
wrapper.slidetranscript	59%	➖
wrapper.vibevoice	31%	❌
wrapper.vibevoice.schedule	50%	➖
wrapper.wan	34%	❌
wrapper.wan22	75%	✔
wrapper.xtts	75%	✔
wrapper.yolo	69%	✔
Summary	79% (27075 / 34464)	✔

James-QiuHaoran · 2026-05-16T00:35:03Z

Superseded by two focused PRs:

Refactor: move model provisioning policies to streamwise/model_provisioner/ #312 — Refactor: move model provisioning policies to streamwise/model_provisioner/
Add auto-deploy feature to StreamWise dashboard #313 — Add auto-deploy feature to StreamWise dashboard (depends on Refactor: move model provisioning policies to streamwise/model_provisioner/ #312)

Haoran Qiu and others added 3 commits May 15, 2026 13:11

Update tests

bccbbd2

James-QiuHaoran requested a review from Copilot May 15, 2026 23:01

James-QiuHaoran self-assigned this May 15, 2026

github-code-quality Bot found potential problems May 15, 2026

View reviewed changes

Comment thread tests/streamwise/test_streamwise_auto_deploy.py Fixed

Copilot started reviewing on behalf of James-QiuHaoran May 15, 2026 23:03 View session

Copilot AI reviewed May 15, 2026

View reviewed changes

Comment thread simulator/data_loading.py Outdated

Fix unused import

4227e1f

Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com>

github-code-quality Bot found potential problems May 15, 2026

View reviewed changes

Comment thread tests/streamwise/test_streamwise_auto_deploy.py Fixed

Fix mypy issues

10313cb

goiri reviewed May 15, 2026

View reviewed changes

Haoran Qiu added 2 commits May 15, 2026 16:18

Fix directory path

0fbdf67

Fix lint

d968027

github-code-quality Bot found potential problems May 15, 2026

View reviewed changes

Comment thread tests/streamwise/test_streamwise_auto_deploy.py Dismissed

kdh0102 suggested changes May 15, 2026

View reviewed changes

Simplify imports

3483606

github-code-quality Bot found potential problems May 16, 2026

View reviewed changes

Comment thread streamwise/allocator_bridge.py Dismissed

Use Path

a0636f8

James-QiuHaoran closed this May 16, 2026

Conversation

James-QiuHaoran commented May 15, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

goiri May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

kdh0102 left a comment

Choose a reason for hiding this comment

Uh oh!

kdh0102 May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented May 16, 2026

Lint Results

Uh oh!

github-actions Bot commented May 16, 2026

Mypy Type Checking

Uh oh!

github-actions Bot commented May 16, 2026

Diff Coverage

Diff: origin/main...HEAD, staged and unstaged changes

Summary

simulator/init.py

streamwise/streamwise.py

wrapper/run_httpserver.py

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

James-QiuHaoran commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions Bot commented May 15, 2026 •

edited

Loading