Add `--test` HF CLI path for 2-layer random model configs, `olive run` and ModelBuilder support, Qwen how-to/layer-types fix, and merge conflict resolution by Copilot · Pull Request #2459 · microsoft/Olive

Copilot · 2026-05-11T08:18:13Z

Describe your changes

Adds a CLI test path for Hugging Face models so generated config.json can carry a lightweight random-model definition instead of always using pretrained weights. When --test is passed, Olive now preserves the source architecture, instantiates a random model with 2 hidden layers, and can persist that test model for reuse.

CLI/config support
- Added --test to HF-backed CLI commands using shared input-model options.
- --test now accepts an optional folder path where the generated test model is saved for reuse.
- Emitted input_model.test_model_config into generated run configs, and now also emits input_model.test_model_path when a save folder is provided or derived.
```
{
  "input_model": {
    "type": "HfModel",
    "model_path": "model-id",
    "test_model_config": { "hidden_layers": 2 },
    "test_model_path": "path/to/test_model"
  }
}
```
- When --test is used without an explicit folder, Olive uses <output_path>/test_model.
- If --test is used in a context where no output path is available, Olive now fails clearly instead of silently skipping persistence.
olive run support
- Extended olive run --test so it can apply the same lightweight HF test-model override to an existing Hugging Face input_model already present in a workflow config.
- When olive run --test is used without an explicit folder, it derives the saved test-model location from the effective workflow output path.
- olive run --test now fails clearly when the workflow config does not contain a Hugging Face input_model.
HF model loading
- Extended HF config loading to accept test_model_config.
- Derived a lightweight config from the original model config by overriding the architecture-specific hidden-layer field (num_hidden_layers, num_layers, n_layer, n_layers).
- For Qwen-style configs that carry per-layer metadata, Olive now also trims layer_types to match the reduced hidden-layer count so the saved reduced config remains valid when reloaded.
- Switched test-model loading to instantiate from config (from_config) so the model is random-initialized rather than loaded from pretrained weights.
- Updated the test-model path to fail fast if the selected model class cannot be instantiated from the reduced config, instead of falling back to another candidate class that could produce a misleading larger model.
- Refactored the from_config loading path to avoid nested try/except handling by only passing trust_remote_code when the model class signature supports it.
- Added persistence/reuse support for test models: if test_model_path already contains a saved HF model, Olive loads that model instead of recreating it; otherwise it creates the reduced model once and saves it there.
ModelBuilder support
- Updated the ModelBuilder pass so --test workflows export from the saved reduced Hugging Face test checkpoint instead of still using the original full checkpoint.
- When test_model_config is present, ModelBuilder now materializes or reuses test_model_path before export and passes that saved checkpoint to the builder.
- This fixes the smoke-test flow so it avoids the original full-model dtype path instead of only deferring the same failure.
IO config / dummy input propagation
- Threaded test_model_config through HF IO-config and dummy-input generation so the reduced-layer model shape metadata stays consistent with the generated test model.
Documentation
- Added a new how-to page showing how to convert a Qwen LLM with a quick --test smoke check first, then rerun the full conversion.
- Linked the new how-to page from the docs How Tos index.
- Updated the how-to flow to use olive optimize --dry_run followed by olive run --test, matching the new CLI support.
- Clarified the smoke-test commands so the generated ONNX artifacts are written to a dedicated output folder and are easy to find.
- Updated the example model from Phi to Qwen/Qwen3-0.6B and renamed the how-to page and index entry to match.
Merge conflict resolution
- Merged origin/main into this PR branch and resolved the conflict in test/passes/onnx/test_model_builder.py.
- Preserved both the upstream ModelBuilder fallback/multi-file output test coverage and this PR's saved test-model-path coverage.
- Updated the upstream mock-based ModelBuilder tests to define the new Hugging Face test-model attributes used by this PR (test_model_config and test_model_path).
Targeted coverage
- Added focused tests for:
  - CLI config generation with --test
  - input-model config serialization of test_model_config
  - input-model config serialization of test_model_path
  - validation when --test needs an explicit folder
  - olive run --test overriding an existing HF input_model from a workflow config
  - validation when olive run --test is used on a non-HF workflow config
  - HF random-model instantiation for multiple config naming conventions
  - fail-fast behavior when test-model instantiation cannot use the expected model class
  - conditional trust_remote_code handling for supported, omitted, and unsupported from_config signatures
  - saving and reusing a persisted HF test model
  - ModelBuilder exporting from the saved reduced test-model checkpoint when test_model_config is active
  - a CLI smoke-flow test that follows the documented olive optimize --dry_run then olive run --test commands with hf-internal-testing/tiny-random-LlamaForCausalLM and verifies an ONNX artifact is produced
  - a Qwen3 regression test that verifies reduced test-model configs keep layer_types aligned with the reduced hidden-layer count and can be reloaded successfully
  - conflict-resolved ModelBuilder tests covering saved test-model reuse, single-file annotation fallback, and multi-file output component naming

Examples:

olive optimize \
  -m Qwen/Qwen3-0.6B \
  --test out/test_model \
  --dry_run \
  -o out

olive run \
  --config out/config.json \
  --test out/test_model \
  --output_path out/qwen-smoke-run

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Release notes: Added a --test option for Hugging Face CLI workflows that can take a folder path, writes a lightweight 2-layer random-model config, saves and reuses the generated HF test model from that folder, fails fast if the reduced test model cannot be instantiated from the expected model class, only passes trust_remote_code when the target from_config supports it, and now keeps Qwen-style layer_types metadata aligned with reduced test-model layer counts so saved reduced configs reload cleanly. Also added olive run --test support for workflow configs with Hugging Face input models, updated ModelBuilder to export from the saved reduced test checkpoint for --test flows, and added a Qwen3 0.6B how-to page for running a quick smoke test before the full conversion with an explicit output path for the generated ONNX files.

(Optional) Issue link

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/d4221aaf-b4a3-4bed-85fe-30a12dcdfd6e Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/57fd060e-5678-4015-b277-c7d14ec52145 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

xadupre · 2026-05-25T15:42:18Z

/azp run

azure-pipelines · 2026-05-25T15:42:30Z

Azure Pipelines successfully started running 1 pipeline(s).

xadupre · 2026-05-25T16:19:39Z

/azp run

azure-pipelines · 2026-05-25T16:19:51Z

Azure Pipelines successfully started running 1 pipeline(s).

…ments in a call' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

xadupre · 2026-05-26T08:55:10Z

/azp run

azure-pipelines · 2026-05-26T08:55:24Z

Azure Pipelines successfully started running 1 pipeline(s).

Copilot

Pull request overview

Copilot reviewed 16 out of 16 changed files in this pull request and generated 4 comments.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

xadupre · 2026-05-26T10:59:10Z

/azp run

azure-pipelines · 2026-05-26T10:59:24Z

Azure Pipelines successfully started running 1 pipeline(s).

…com/microsoft/Olive into copilot/fr-add-model-to-config-json

xadupre · 2026-05-26T14:04:37Z

/azp run

azure-pipelines · 2026-05-26T14:04:51Z

Azure Pipelines successfully started running 1 pipeline(s).

xadupre · 2026-05-27T10:07:31Z

/azp run

azure-pipelines · 2026-05-27T10:07:48Z

Azure Pipelines successfully started running 1 pipeline(s).

Initial plan

ef42e47

Copilot AI assigned Copilot and xadupre May 11, 2026

Copilot AI linked an issue May 11, 2026 that may be closed by this pull request

[FR]: add a section in config.json which defines a random model following the same architecture with 2 hidden layers #2458

Open

7 tasks

Copilot started work on behalf of xadupre May 11, 2026 08:18 View session

Copilot AI and others added 3 commits May 11, 2026 08:33

feat: add CLI test-model config for HF inputs

00571f0

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: broaden HF test-model coverage

485dfbf

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

chore: polish test model config handling

a6fa34a

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add section in config.json for random model with 2 hidden layers~~ Add --test HF CLI path for 2-layer random model configs May 11, 2026

Copilot AI requested a review from xadupre May 11, 2026 08:39

Copilot finished work on behalf of xadupre May 11, 2026 08:39

xadupre reviewed May 11, 2026

View reviewed changes

Comment thread olive/common/hf/utils.py Outdated

Copilot started work on behalf of xadupre May 11, 2026 09:13 View session

fix: fail fast for HF test model loading

273850c

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/d4221aaf-b4a3-4bed-85fe-30a12dcdfd6e Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~Add --test HF CLI path for 2-layer random model configs~~ Add --test HF CLI path for 2-layer random model configs with fail-fast loading May 11, 2026

Copilot finished work on behalf of xadupre May 11, 2026 09:22

Copilot AI requested a review from xadupre May 11, 2026 09:22

Copilot started work on behalf of xadupre May 11, 2026 09:28 View session

Copilot AI and others added 2 commits May 11, 2026 09:35

refactor: remove nested try from HF test loading

318fcbe

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/57fd060e-5678-4015-b277-c7d14ec52145 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: cover trust_remote_code helper behavior

40b0740

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/57fd060e-5678-4015-b277-c7d14ec52145 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot finished work on behalf of xadupre May 11, 2026 09:39

xadupre reviewed May 11, 2026

View reviewed changes

Comment thread olive/cli/base.py

Copilot started work on behalf of xadupre May 11, 2026 10:47 View session

Copilot AI and others added 3 commits May 11, 2026 10:56

feat: persist reusable HF test model path

386ff01

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

fix: tighten HF test model path handling

09fac8c

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

refactor: simplify test model path handling

09df0a7

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~Add --test HF CLI path for 2-layer random model configs with fail-fast loading~~ Add --test HF CLI path for 2-layer random model configs with reusable saved test models May 11, 2026

Copilot finished work on behalf of xadupre May 11, 2026 11:03

Copilot AI requested a review from xadupre May 11, 2026 11:03

github-advanced-security AI found potential problems May 11, 2026

View reviewed changes

Copilot AI requested a review from xadupre May 25, 2026 14:58

rename

acd8c8a

rename

1239db7

xadupre reviewed May 25, 2026

View reviewed changes

Comment thread docs/source/how-to/cli/cli-fast-test.md

Protect test output reuse

03d7528

github-advanced-security AI found potential problems May 25, 2026

View reviewed changes

xadupre added 2 commits May 26, 2026 10:19

fix remaining unit tests

9d5f85b

fix lint issues

70625b9

github-advanced-security AI found potential problems May 26, 2026

View reviewed changes

Comment thread olive/common/hf/utils.py Fixed

Potential fix for pull request finding 'CodeQL / Wrong number of argu…

6f876df

…ments in a call' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

Copilot AI reviewed May 26, 2026

View reviewed changes

Comment thread olive/cli/run.py Outdated

Comment thread olive/common/hf/utils.py Outdated

Comment thread test/cli/test_cli_test_model_smoke.py Outdated

Comment thread olive/cli/base.py

xadupre and others added 4 commits May 26, 2026 12:28

Potential fix for pull request finding

2cb2f41

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Relax smoke test output file assertions

9a73e09

Fix --test dry-run output marking

4408a05

Potential fix for pull request finding

e0e15ee

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

xadupre added 2 commits May 26, 2026 16:03

comment

8040341

Merge branch 'copilot/fr-add-model-to-config-json' of https://github.…

f16bc10

…com/microsoft/Olive into copilot/fr-add-model-to-config-json

shaahji approved these changes May 26, 2026

View reviewed changes

Comment thread .github/workflows/test-model-fast.yml

Conversation

Copilot AI commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xadupre commented May 25, 2026

Uh oh!

azure-pipelines Bot commented May 25, 2026

Uh oh!

Uh oh!

xadupre commented May 25, 2026

Uh oh!

azure-pipelines Bot commented May 25, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xadupre commented May 26, 2026

Uh oh!

azure-pipelines Bot commented May 26, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xadupre commented May 26, 2026

Uh oh!

azure-pipelines Bot commented May 26, 2026

Uh oh!

xadupre commented May 26, 2026

Uh oh!

azure-pipelines Bot commented May 26, 2026

Uh oh!

Uh oh!

xadupre commented May 27, 2026

Uh oh!

azure-pipelines Bot commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Copilot AI commented May 11, 2026 •

edited

Loading