fix(perf): resolve concrete EP for the analyzer instead of ep=None (#931) by xieofxie · Pull Request #941 · microsoft/winml-cli

xieofxie · 2026-06-23T06:06:50Z

Summary

Fixes #931.

In winml perf without --ep, the EP was never resolved before the build, so the static analyzer ran with ep=None and aggregated across all EPs (logging analyze_onnx called with ep=None — results will aggregate all EPs), even though inference runs on a single device's EP.

This resolves a concrete device + EP from the request and passes it down to the build:

PerfBenchmark resolves device + EP internally (_resolve_device_ep), at the start of _load_model so an unavailable/invalid combo fails fast before the export/optimize/quantize/compile pipeline runs (previously it only surfaced at session.compile()). BenchmarkConfig keeps only the raw request; the resolved values live on the instance and drive from_pretrained/from_onnx.
_perf_modules derives a concrete EP from the resolved device when none is given. Explicit EPs are kept verbatim (downstream stages normalize aliases).
The CLI perf() no longer pre-resolves — it just builds the config and dispatches.

WinMLAutoModel stays permissive: ep=None remains a valid library mode (aggregate across EPs). The fix makes perf — which targets one device — always pass an explicit EP, which is what the analyzer warning asked for.

Verification

End-to-end A/B on a real CPU build (--no-skip-build, no --ep):

	analyzer target	`ep=None` warning
before	`None on cpu`	fires
after	`OpenVINOExecutionProvider on cpu`	gone

Unit tests (tests/unit/commands/test_perf_cli.py, test_perf_module.py, 51 passed) cover: derived concrete EP reaches from_onnx; explicit EP passes through verbatim; an unavailable combo raises before the build is kicked off.

Follow-up

#939 tracks folding _perf_modules into PerfBenchmark so the two resolution sites become one.

) In `winml perf` without `--ep`, the EP was never resolved before the build, so the static analyzer ran with `ep=None` and aggregated across all EPs (and logged "analyze_onnx called with ep=None — results will aggregate all EPs"), even though inference runs on a single device's EP. Resolve a concrete device + EP from the request and pass it down to the build: - PerfBenchmark resolves device + EP internally (_resolve_device_ep), at the start of _load_model so an unavailable/invalid combo fails fast before the export/optimize/quantize/compile pipeline runs. BenchmarkConfig keeps only the raw request; the resolved values live on the instance and drive from_pretrained / from_onnx. - _perf_modules derives a concrete EP from the resolved device when none is given (explicit EPs are kept verbatim; downstream stages normalize aliases). - The CLI no longer pre-resolves: it just builds the config / dispatches. Verified end-to-end on a CPU build: analyzer target goes from "None on cpu" to "OpenVINOExecutionProvider on cpu" and the ep=None warning no longer fires. Follow-up #939 tracks folding _perf_modules into PerfBenchmark to unify the two resolution sites.

DingmaomaoBJTU

Clean fix. The EP resolution logic is correct, idempotent, and placed at the right point (before the expensive build pipeline). Tests cover all the important cases: derived EP reaches from_onnx, explicit EP passes verbatim, unavailable combo fails fast. One minor nit: when resolve_eps returns [], _resolved_ep silently stays None — a comment making that invariant explicit would help future readers.

xieofxie requested a review from a team as a code owner June 23, 2026 06:06

DingmaomaoBJTU approved these changes Jun 23, 2026

View reviewed changes

Comment thread src/winml/modelkit/commands/perf.py

Comment thread src/winml/modelkit/commands/perf.py

xieofxie merged commit 6977294 into main Jun 23, 2026
9 checks passed

xieofxie deleted the hualxie/perf_analyze_ep branch June 23, 2026 06:48

xieofxie mentioned this pull request Jun 23, 2026

fix(build): forward --ep to config generation; gate compile builds on EP availability #947

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(perf): resolve concrete EP for the analyzer instead of ep=None (#931)#941

fix(perf): resolve concrete EP for the analyzer instead of ep=None (#931)#941
xieofxie merged 1 commit into
mainfrom
hualxie/perf_analyze_ep

xieofxie commented Jun 23, 2026

Uh oh!

DingmaomaoBJTU left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xieofxie commented Jun 23, 2026

Summary

Verification

Follow-up

Uh oh!

DingmaomaoBJTU left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants