microsoft/winml-cli v0.2.0
microsoft/winml-cli
Captured source
source ↗published Jun 25, 2026seen 1dcaptured 1dhttp 200method plain
WinML CLI v0.2.0
Repository: microsoft/winml-cli
Tag: v0.2.0
Published: 2026-06-25T03:37:55Z
Prerelease: no
Release notes: This cycle unifies task detection across the CLI (modality- and architecture-aware) and expands the eval and perf surfaces — new depth-estimation and tensor-similarity evaluators, a full SA eval pipeline with an HTML report, winml perf --memory / --ep-options, and --format json on eval / analyze / perf. winml compile gains a multi-model shared EP context, winml build gains --precision, and timm image-classification is supported. See the behavior changes below.
⚠️ Behavior changes
winml perfno longer compiles by default — added--compile/--no-compile, defaulting to no-compile (#879).- Boolean CLI options are now
--flag/--no-flagpairs (#844). - Telemetry is enabled in the shipped wheel; consent reworded as "unlinked pseudonymized" (#810).
✨ Improvements
- Task detection — modality- and architecture-aware
detect_task, unified across commands viaresolve_task/TaskResolution(#807, #841, #878). winml perf—--memoryreports RAM/VRAM per phase (#861);--ep-optionspasses runtime EP options (#865, #889); output now shows the model path and precision (#875).winml compile— multi-model shared EP context with a selectable backend (#871).winml build— added--precision(#914).winml inspect— renders composite (pipeline-led) model structure (#903).winml analyze—--ep/--deviceauto resolves to a single best target (#919); faster re-runs plus a--debugrule locator (#906).winml eval— new SA eval pipeline with per-stage perf and an HTML report (#599); depth-estimation (#326, #437) and tensor-similarity (#805) evaluators; scripts track ONNX size and sanitize output (#755).- Cross-command —
--format jsononeval/analyze/perf(#855);--allow-unsupported-nodesonperf/build/eval/run(#821). - Quality of life — timm image-classification via library routing (#790);
~expanded in paths (#815); progress bar during EP warmup (#788); refreshed--list-devicecoloring (#812).
🐛 Fixes
- `winml perf` — declared
psutilas a runtime dependency, fixing a crash on clean install (#937); composite (dual-encoder) models supported (#866); HF and ONNX paths unified throughPerfBenchmark(#659);--monitorlive chart in--modulemode (#654, #920);richLive thread crashes (#832). - `winml analyze` — coverage-counting bugs (#922); analyzer API EP list matches the CLI (#803); Pad / Gemm rule conflicts (#906).
- Task / config validation — fill-mask heads detected as
text2text-generation(#851); vision feature-extraction model-task inconsistency (#786); model task validated in config (#723); full encoder-decoder composite built for no-task seq2seq (#850, #862); device/EP combination validated without a system check (#780). - `winml export` —
.datafiles written to the output dir, not the cwd (#853); timmimage_sizefrompretrained_cfg(#806). - `winml inspect` / `winml catalog` —
--taskvalidated at parse time (#546, #771);catalog -tshort flag aligned (#541, #772); VitisAI EP ordered last, catalog table width fixed (#763). - Feature extraction —
last_hidden_statenow populated in the output (#863). - `winml optimize` — untie batched constant
MatMulfor OpenVINO GPU (#817). - `winml eval` — fixed failures on AMD hosts (#783); cleanup runs on
SKIP_*/ exception paths (#890). - CLI output — quieted
optimumlogger noise (#904); unified verbosity, logger routed to stderr (#566, #793).
📦 Assets
winml_cli-0.2.0-py3-none-any.whlrules-v0.2.0.zip
Notability
notability 3.0/10Routine tool release, v0.2.0.