microsoft/onnxruntime v1.24.4
microsoft/onnxruntime
Captured source
source ↗published Mar 17, 2026seen 5dcaptured 10hhttp 200method plain
ONNX Runtime v1.24.4
Repository: microsoft/onnxruntime
Tag: v1.24.4
Published: 2026-03-17T23:08:09Z
Prerelease: no
Release notes: This is a patch release for ONNX Runtime 1.24, containing bug fixes and execution provider updates.
Bug Fixes
- Core: Added PCI bus fallback for Linux GPU device discovery in containerized environments (e.g., AKS/Kubernetes) where
nvidia-drmis not loaded but GPU PCI devices are still exposed via sysfs. (#27591) - Plugin EP: Fixed null pointer dereference when iterating output spans in
GetOutputIndex. (#27644) - Plugin EP: Fixed bug that incorrectly assigned duplicate MetaDef IDs to fused nodes in different GraphViews (e.g., then/else branches of an If node), causing session creation to fail with a conflicting kernel error. (#27666)
Execution Provider Updates
- QNN EP: Enabled offline x64 compilation with memhandle IO type by deferring rpcmem library loading to inference time. (#27479)
- QNN EP: Reverted QNN SDK logging verbosity changes that caused segmentation faults on backend destruction. (#27650)
Build and Infrastructure
- Python: Updated
python_requiresfrom>=3.10to>=3.11to reflect dropped Python 3.10 support. (#27354) - Build: Replaced
__builtin_ia32_tpausewith the compiler-portable_tpauseintrinsic to fix cross-compiler portability issues between GCC and LLVM. (#27607)
Full Changelog: v1.24.3...v1.24.4
Contributors
@derdeljan-msft, @adrianlizarraga, @apwojcik, @baijumeswani, @edgchen1, @mocknen, @tianleiwu, @XXXXRT666
Notability
notability 3.0/10Routine patch release, no major traction