ReleaseMicrosoftMicrosoftpublished Apr 27, 2026seen 5d

microsoft/onnxruntime v1.25.1

microsoft/onnxruntime

Open original ↗

Captured source

source ↗
published Apr 27, 2026seen 5dcaptured 8hhttp 200method plain

ONNX Runtime v1.25.1

Repository: microsoft/onnxruntime

Tag: v1.25.1

Published: 2026-04-27T21:46:20Z

Prerelease: no

Release notes: n.b. This changelog is LLM generated. Only the contributor listing has been verified.

ONNX Runtime Release 1.25.1

📢 Announcements & Breaking Changes

ONNX Op Updates

  • Enhanced ONNX operator support with new opset versions: Reshape (opset 25), Transpose (opset 24) (#27752)

---

✨ New Features

📊 New ONNX Ops & Model Support

  • LinearAttention and CausalConvState operators for Qwen3.5 model support (#27907)
  • RotaryEmbedding (RotEMB) and RMSNorm operators added (#27752)
  • Linear Attention signature support (#27842)

---

🌐 Web & JavaScript

WebGPU EP

  • Qwen3.5 model support on WebGPU execution provider (#27996)
  • QMoE 1-token decode path optimization — fused operations to reduce GPU dispatches for improved performance (#27998)

---

🐛 Bug Fixes

Core Runtime Fixes

  • Improved filesystem error messages during Linux device discovery for better debugging experience (#27289)
  • Fixed missing include for SetRawDataInTensorProto in NVIDIA TensorRT RTX tests (#28065)

---

🙏 Contributors

Thanks to our 7 contributors for this release: @guschmue, @sanaa-hamel-microsoft, @apsonawane, @eserscor, @ishwar-raut1, @qjia7, @theHamsta

Full Changelog: https://github.com/microsoft/onnxruntime/compare/v1.25.0...v1.25.1

Notability

notability 3.0/10

Routine patch release, low impact.