ReleaseMicrosoftMicrosoftpublished Sep 22, 2025seen 5d

microsoft/Olive v0.9.3

microsoft/Olive

Open original ↗

Captured source

source ↗
published Sep 22, 2025seen 5dcaptured 11hhttp 200method plain

Olive-ai 0.9.3

Repository: microsoft/Olive

Tag: v0.9.3

Published: 2025-09-22T22:31:57Z

Prerelease: no

Release notes:

New Features:

  • Compatibility with Windows ML for ONNX model inference and evaluation (#2052, #2056, #2059, #2084).
  • Gptq quantization supports lm_head quantization and more generic weight packing (#2137).

Improvements

  • optimize CLI supports WebGPU execution provider (#2076) and NVTensorRtRTX execution provider (#2078).
  • quantize CLI supports Gptq pass as an implementation (#2115).
  • Onnx static quantization supports strided calibration data for lower memory usage (#2086).
  • Extra options can be provided directedly to the ModelBuilder pass (#2107).
  • LMEvaluator has a new ORT backend with IOBinding leading to large speedup in runtime (#2133).
  • OnnxFloatToFloat16 allows more granular control through op_include_list and node_include_list (#2134).
  • AIMET quantization pass: Support for exclude op types (#2055), pre-quantized models (#2111), LLM augmented dataloaders (#2108), LPBQ (#2119), and Adaround (#2140).

Deprecation

As per the deprecation warning in the previous release, the following Azure ML related features have been removed:

  • Azure ML system
  • Azure ML resource types: model, datastore, job outputs.
  • Remote workflow
  • Azure ML artifact packaging

Other removed features include:

  • IsolatedORT System (#2070)
  • Quantization Aware Training (#2089)
  • AppendPrePostProcessingOps pass (#2090)
  • SNPE passes (#2098)

Recipes Migration

All recipes have been migrated to olive-recipes repository.

Notability

notability 3.0/10

Minor version update for model optimization tool