Add `--test` HF CLI path for 2-layer random model configs, `olive run` and ModelBuilder support, Qwen how-to/layer-types fix, and merge conflict resolution by Copilot · Pull Request #2459 · microsoft/Olive

Copilot · 2026-05-11T08:18:13Z

Describe your changes

Adds a CLI test path for Hugging Face models so generated config.json can carry a lightweight random-model definition instead of always using pretrained weights. When --test is passed, Olive now preserves the source architecture, instantiates a random model with 2 hidden layers, and can persist that test model for reuse.

CLI/config support
- Added --test to HF-backed CLI commands using shared input-model options.
- --test now accepts an optional folder path where the generated test model is saved for reuse.
- Emitted input_model.test_model_config into generated run configs, and now also emits input_model.test_model_path when a save folder is provided or derived.
```
{
  "input_model": {
    "type": "HfModel",
    "model_path": "model-id",
    "test_model_config": { "hidden_layers": 2 },
    "test_model_path": "path/to/test_model"
  }
}
```
- When --test is used without an explicit folder, Olive uses <output_path>/test_model.
- If --test is used in a context where no output path is available, Olive now fails clearly instead of silently skipping persistence.
olive run support
- Extended olive run --test so it can apply the same lightweight HF test-model override to an existing Hugging Face input_model already present in a workflow config.
- When olive run --test is used without an explicit folder, it derives the saved test-model location from the effective workflow output path.
- olive run --test now fails clearly when the workflow config does not contain a Hugging Face input_model.
HF model loading
- Extended HF config loading to accept test_model_config.
- Derived a lightweight config from the original model config by overriding the architecture-specific hidden-layer field (num_hidden_layers, num_layers, n_layer, n_layers).
- For Qwen-style configs that carry per-layer metadata, Olive now also trims layer_types to match the reduced hidden-layer count so the saved reduced config remains valid when reloaded.
- Switched test-model loading to instantiate from config (from_config) so the model is random-initialized rather than loaded from pretrained weights.
- Updated the test-model path to fail fast if the selected model class cannot be instantiated from the reduced config, instead of falling back to another candidate class that could produce a misleading larger model.
- Refactored the from_config loading path to avoid nested try/except handling by only passing trust_remote_code when the model class signature supports it.
- Added persistence/reuse support for test models: if test_model_path already contains a saved HF model, Olive loads that model instead of recreating it; otherwise it creates the reduced model once and saves it there.
ModelBuilder support
- Updated the ModelBuilder pass so --test workflows export from the saved reduced Hugging Face test checkpoint instead of still using the original full checkpoint.
- When test_model_config is present, ModelBuilder now materializes or reuses test_model_path before export and passes that saved checkpoint to the builder.
- This fixes the smoke-test flow so it avoids the original full-model dtype path instead of only deferring the same failure.
IO config / dummy input propagation
- Threaded test_model_config through HF IO-config and dummy-input generation so the reduced-layer model shape metadata stays consistent with the generated test model.
Documentation
- Added a new how-to page showing how to convert a Qwen LLM with a quick --test smoke check first, then rerun the full conversion.
- Linked the new how-to page from the docs How Tos index.
- Updated the how-to flow to use olive optimize --dry_run followed by olive run --test, matching the new CLI support.
- Clarified the smoke-test commands so the generated ONNX artifacts are written to a dedicated output folder and are easy to find.
- Updated the example model from Phi to Qwen/Qwen3-0.6B and renamed the how-to page and index entry to match.
Merge conflict resolution
- Merged origin/main into this PR branch and resolved the conflict in test/passes/onnx/test_model_builder.py.
- Preserved both the upstream ModelBuilder fallback/multi-file output test coverage and this PR's saved test-model-path coverage.
- Updated the upstream mock-based ModelBuilder tests to define the new Hugging Face test-model attributes used by this PR (test_model_config and test_model_path).
Targeted coverage
- Added focused tests for:
  - CLI config generation with --test
  - input-model config serialization of test_model_config
  - input-model config serialization of test_model_path
  - validation when --test needs an explicit folder
  - olive run --test overriding an existing HF input_model from a workflow config
  - validation when olive run --test is used on a non-HF workflow config
  - HF random-model instantiation for multiple config naming conventions
  - fail-fast behavior when test-model instantiation cannot use the expected model class
  - conditional trust_remote_code handling for supported, omitted, and unsupported from_config signatures
  - saving and reusing a persisted HF test model
  - ModelBuilder exporting from the saved reduced test-model checkpoint when test_model_config is active
  - a CLI smoke-flow test that follows the documented olive optimize --dry_run then olive run --test commands with hf-internal-testing/tiny-random-LlamaForCausalLM and verifies an ONNX artifact is produced
  - a Qwen3 regression test that verifies reduced test-model configs keep layer_types aligned with the reduced hidden-layer count and can be reloaded successfully
  - conflict-resolved ModelBuilder tests covering saved test-model reuse, single-file annotation fallback, and multi-file output component naming

Examples:

olive optimize \
  -m Qwen/Qwen3-0.6B \
  --test out/test_model \
  --dry_run \
  -o out

olive run \
  --config out/config.json \
  --test out/test_model \
  --output_path out/qwen-smoke-run

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Release notes: Added a --test option for Hugging Face CLI workflows that can take a folder path, writes a lightweight 2-layer random-model config, saves and reuses the generated HF test model from that folder, fails fast if the reduced test model cannot be instantiated from the expected model class, only passes trust_remote_code when the target from_config supports it, and now keeps Qwen-style layer_types metadata aligned with reduced test-model layer counts so saved reduced configs reload cleanly. Also added olive run --test support for workflow configs with Hugging Face input models, updated ModelBuilder to export from the saved reduced test checkpoint for --test flows, and added a Qwen3 0.6B how-to page for running a quick smoke test before the full conversion with an explicit output path for the generated ONNX files.

(Optional) Issue link

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/d4221aaf-b4a3-4bed-85fe-30a12dcdfd6e Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/57fd060e-5678-4015-b277-c7d14ec52145 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot · 2026-05-11T15:32:13Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

huggingface.co
- Triggering command: /usr/bin/python python - (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/5bd5199d-42fa-4109-94a3-c7995abe72a2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot · 2026-05-11T17:00:02Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

huggingface.co
- Triggering command: /home/REDACTED/work/_temp/ghcca-node/node/bin/node /home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js (dns block)
- Triggering command: /usr/bin/python python - (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

xadupre · 2026-05-12T09:43:53Z

@copilot resolve the merge conflicts in this pull request

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/2f777346-5b6a-423f-89fd-f40de41d2b70 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot · 2026-05-12T09:58:30Z

@copilot resolve the merge conflicts in this pull request

Resolved in 5daba5d. I merged origin/main, kept both the upstream ModelBuilder test additions and this PR's saved-test-model coverage, and finalized it as a true two-parent merge commit.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

huggingface.co
- Triggering command: /usr/bin/python python -m pytest test/passes/onnx/test_model_builder.py (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Copilot

Pull request overview

This PR adds a --test mode for Hugging Face-backed Olive CLI workflows and olive run, enabling generation and reuse of a lightweight random-initialized 2-layer “test model” (including persistence via test_model_path) to support fast smoke checks before running full conversions. It also threads test_model_config through HF config/IO/dummy-input generation, updates ModelBuilder to export from the saved reduced checkpoint when present, and adds a Qwen-focused how-to plus targeted tests.

Changes:

Add CLI + config support for --test to emit input_model.test_model_config and input_model.test_model_path, including olive run --test override behavior.
Extend HF loading utilities to apply reduced-layer config overrides, instantiate from from_config, and optionally persist/reuse a saved test model directory.
Update ModelBuilder to use the saved test-model checkpoint when test_model_config is active; add tests and a new Qwen smoke-test how-to doc.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
test/passes/onnx/test_model_builder.py	Adds coverage ensuring ModelBuilder uses a saved `test_model_path` when `test_model_config` is set.
test/common/test_hf.py	Adds unit tests for reduced-config application, trust_remote_code handling, persistence/reuse, and fail-fast behavior.
test/cli/test_cli.py	Adds CLI workflow tests for `olive run --test` overrides and optimize→run smoke flow.
test/cli/test_base.py	Adds tests for `get_input_model_config` emitting `test_model_config`/`test_model_path` and validation when output path isn’t available.
olive/passes/onnx/model_builder.py	Switches ModelBuilder input to `test_model_path` when `test_model_config` is present (materializing if needed).
olive/model/handler/mixin/hf.py	Threads `test_model_config` into HF config/IO/dummy-input helpers.
olive/model/handler/hf.py	Adds `test_model_path` resource + `test_model_config` JSON config plumbing and passes them into HF loader.
olive/common/hf/utils.py	Implements reduced-layer config application + random init via `from_config`, plus optional persistence/reuse.
olive/common/hf/model_io.py	Adds `test_model_config` parameter propagation into config-based IO/dummy-input generation.
olive/cli/run.py	Enables `olive run --test` to inject HF test-model settings into an existing HF `input_model` in a run config.
olive/cli/base.py	Adds shared helper and CLI option wiring for `--test` on HF input model options.
docs/source/how-to/index.md	Links the new Qwen `--test` smoke-check how-to from the How Tos index.
docs/source/how-to/cli/cli-convert-qwen-test.md	New how-to describing a Qwen smoke check using `olive optimize --dry_run` followed by `olive run --test`.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Initial plan

ef42e47

Copilot AI assigned Copilot and xadupre May 11, 2026

Copilot AI linked an issue May 11, 2026 that may be closed by this pull request

[FR]: add a section in config.json which defines a random model following the same architecture with 2 hidden layers #2458

Open

7 tasks

Copilot started work on behalf of xadupre May 11, 2026 08:18 View session

Copilot AI and others added 3 commits May 11, 2026 08:33

feat: add CLI test-model config for HF inputs

00571f0

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: broaden HF test-model coverage

485dfbf

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

chore: polish test model config handling

a6fa34a

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/e2149604-9db1-438e-a31f-01a53886093d Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add section in config.json for random model with 2 hidden layers~~ Add --test HF CLI path for 2-layer random model configs May 11, 2026

Copilot AI requested a review from xadupre May 11, 2026 08:39

Copilot finished work on behalf of xadupre May 11, 2026 08:39

xadupre reviewed May 11, 2026

View reviewed changes

Comment thread olive/common/hf/utils.py Outdated

Copilot started work on behalf of xadupre May 11, 2026 09:13 View session

fix: fail fast for HF test model loading

273850c

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/d4221aaf-b4a3-4bed-85fe-30a12dcdfd6e Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~Add --test HF CLI path for 2-layer random model configs~~ Add --test HF CLI path for 2-layer random model configs with fail-fast loading May 11, 2026

Copilot finished work on behalf of xadupre May 11, 2026 09:22

Copilot AI requested a review from xadupre May 11, 2026 09:22

Copilot started work on behalf of xadupre May 11, 2026 09:28 View session

Copilot AI and others added 2 commits May 11, 2026 09:35

refactor: remove nested try from HF test loading

318fcbe

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/57fd060e-5678-4015-b277-c7d14ec52145 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: cover trust_remote_code helper behavior

40b0740

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/57fd060e-5678-4015-b277-c7d14ec52145 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot finished work on behalf of xadupre May 11, 2026 09:39

xadupre reviewed May 11, 2026

View reviewed changes

Comment thread olive/cli/base.py

Copilot started work on behalf of xadupre May 11, 2026 10:47 View session

Copilot AI and others added 3 commits May 11, 2026 10:56

feat: persist reusable HF test model path

386ff01

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

fix: tighten HF test model path handling

09fac8c

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

refactor: simplify test model path handling

09df0a7

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/cf4d7fdc-1de9-4098-adc3-3bedd436d7d0 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~Add --test HF CLI path for 2-layer random model configs with fail-fast loading~~ Add --test HF CLI path for 2-layer random model configs with reusable saved test models May 11, 2026

Copilot finished work on behalf of xadupre May 11, 2026 11:03

Copilot AI requested a review from xadupre May 11, 2026 11:03

github-advanced-security AI found potential problems May 11, 2026

View reviewed changes

Copilot AI and others added 7 commits May 11, 2026 15:17

test: polish documented smoke flow test

d0f868f

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: rename smoke flow cli test

a408b63

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: refine smoke flow workflow stubs

e272c2a

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: tidy smoke flow helper names

c901b63

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: clarify smoke flow mocks

36410cd

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: polish documented smoke flow test naming

e16cb82

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

test: lift smoke flow imports and mock defaults

7507604

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/c421adc3-0615-4d10-bf15-b21d632d70b2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot finished work on behalf of xadupre May 11, 2026 15:32

Copilot AI requested a review from xadupre May 11, 2026 15:32

xadupre reviewed May 11, 2026

View reviewed changes

Comment thread docs/source/how-to/cli/cli-convert-qwen-test.md

Copilot started work on behalf of xadupre May 11, 2026 16:43 View session

fix: keep qwen test layer types in sync

ac7840f

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/5bd5199d-42fa-4109-94a3-c7995abe72a2 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Copilot AI changed the title ~~Add --test HF CLI path for 2-layer random model configs, olive run and ModelBuilder support, and Qwen how-to~~ Add --test HF CLI path for 2-layer random model configs, olive run and ModelBuilder support, and Qwen how-to/layer-types fix May 11, 2026

Copilot finished work on behalf of xadupre May 11, 2026 17:00

Copilot AI requested a review from xadupre May 11, 2026 17:00

Copilot started work on behalf of xadupre May 12, 2026 09:44 View session

Copilot AI and others added 2 commits May 12, 2026 09:54

Merge origin/main and resolve model builder test conflict

f165dda

Agent-Logs-Url: https://github.com/microsoft/Olive/sessions/2f777346-5b6a-423f-89fd-f40de41d2b70 Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

Merge origin/main into copilot/fr-add-model-to-config-json

5daba5d

Co-authored-by: xadupre <22452781+xadupre@users.noreply.github.com>

xadupre marked this pull request as ready for review May 12, 2026 09:58

Copilot AI review requested due to automatic review settings May 12, 2026 09:58

Copilot finished work on behalf of xadupre May 12, 2026 09:59

Copilot started reviewing on behalf of xadupre May 12, 2026 10:04 View session

Copilot AI reviewed May 12, 2026

View reviewed changes

Comment thread olive/common/hf/utils.py Outdated

xadupre and others added 2 commits May 12, 2026 12:10

Potential fix for pull request finding

8941efb

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Merge branch 'main' into copilot/fr-add-model-to-config-json

be35ef4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `--test` HF CLI path for 2-layer random model configs, `olive run` and ModelBuilder support, Qwen how-to/layer-types fix, and merge conflict resolution#2459

Add `--test` HF CLI path for 2-layer random model configs, `olive run` and ModelBuilder support, Qwen how-to/layer-types fix, and merge conflict resolution#2459
Copilot wants to merge 37 commits into
mainfrom
copilot/fr-add-model-to-config-json

Copilot AI commented May 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented May 11, 2026

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

Copilot AI commented May 11, 2026

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

xadupre commented May 12, 2026

Uh oh!

Copilot AI commented May 12, 2026 •

edited

Loading

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Copilot AI commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented May 11, 2026

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

Copilot AI commented May 11, 2026

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

xadupre commented May 12, 2026

Uh oh!

Copilot AI commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Copilot AI commented May 11, 2026 •

edited

Loading

Copilot AI commented May 12, 2026 •

edited

Loading