microsoft/Olive v0.9.3
microsoft/Olive
Captured source
source ↗published Sep 22, 2025seen 5dcaptured 11hhttp 200method plain
Olive-ai 0.9.3
Repository: microsoft/Olive
Tag: v0.9.3
Published: 2025-09-22T22:31:57Z
Prerelease: no
Release notes:
New Features:
- Compatibility with Windows ML for ONNX model inference and evaluation (#2052, #2056, #2059, #2084).
Gptqquantization supportslm_headquantization and more generic weight packing (#2137).
Improvements
optimizeCLI supportsWebGPUexecution provider (#2076) andNVTensorRtRTXexecution provider (#2078).quantizeCLI supports Gptq pass as an implementation (#2115).Onnx static quantizationsupports strided calibration data for lower memory usage (#2086).- Extra options can be provided directedly to the
ModelBuilderpass (#2107). LMEvaluatorhas a new ORT backend withIOBindingleading to large speedup in runtime (#2133).OnnxFloatToFloat16allows more granular control throughop_include_listandnode_include_list(#2134).AIMETquantization pass: Support for exclude op types (#2055), pre-quantized models (#2111), LLM augmented dataloaders (#2108), LPBQ (#2119), and Adaround (#2140).
Deprecation
As per the deprecation warning in the previous release, the following Azure ML related features have been removed:
- Azure ML system
- Azure ML resource types: model, datastore, job outputs.
- Remote workflow
- Azure ML artifact packaging
Other removed features include:
IsolatedORT System(#2070)Quantization Aware Training(#2089)AppendPrePostProcessingOpspass (#2090)SNPEpasses (#2098)
Recipes Migration
All recipes have been migrated to olive-recipes repository.
Notability
notability 3.0/10Minor version update for model optimization tool