NVIDIA/MatX v0.9.4
NVIDIA/MatX
Captured source
source ↗published Oct 27, 2025seen 5dcaptured 8hhttp 200method plain
v0.9.4
Repository: NVIDIA/MatX
Tag: v0.9.4
Published: 2025-10-27T17:38:49Z
Prerelease: no
Release notes: Note: MatX is approaching a 1.0 release with several major updates. 1.0 will contain CUDA JIT capabilities that allow better kernel fusion and overall improvements in kernel runtimes. Along with the JIT capabilities, most files have changes that allow for efficient improvements in the kernels. MatX 1.0 will require C++20 support in both the CUDA and host compilers. CUDA 11.8 support will no longer be supported.
Notable Changes:
- apply() and apply_idx() operators for writing lambda-based custom operators
Full Changelog
- Add profiling unit tests and fix timer safety by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1060
- Fixed-size reductions by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1061
- Fix gcc warning by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1062
- Added enum documentation for all operators by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1063
- Support ND operators and transforms to/from python by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1064
- Add prerun_done_ flag to prevent duplicate PreRun executions in transform operators by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1065
- Fix some iterator issues that come up with CCCL ToT by @miscco in https://github.com/NVIDIA/MatX/pull/1066
- Properly use an
if constexprto guard segemented CUB algorithms by @miscco in https://github.com/NVIDIA/MatX/pull/1067 - Fix cuTENSORNet/cuDSS library path and update to new cuTensorNet API by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1069
- Added apply() operator by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1072
- Update stdd docs by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1076
- Update release container to CUDA 13.0.1 by @tmartin-gh in https://github.com/NVIDIA/MatX/pull/1068
- Add apply_idx operator for index-aware computations by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1077
- Fix missing include of `` by @miscco in https://github.com/NVIDIA/MatX/pull/1078
Full Changelog: https://github.com/NVIDIA/MatX/compare/v0.9.3...v0.9.4
Notability
notability 3.0/10Routine library release, no major traction