ReleaseNVIDIANVIDIApublished Oct 27, 2025seen 5d

NVIDIA/MatX v0.9.4

NVIDIA/MatX

Open original ↗

Captured source

source ↗
published Oct 27, 2025seen 5dcaptured 8hhttp 200method plain

v0.9.4

Repository: NVIDIA/MatX

Tag: v0.9.4

Published: 2025-10-27T17:38:49Z

Prerelease: no

Release notes: Note: MatX is approaching a 1.0 release with several major updates. 1.0 will contain CUDA JIT capabilities that allow better kernel fusion and overall improvements in kernel runtimes. Along with the JIT capabilities, most files have changes that allow for efficient improvements in the kernels. MatX 1.0 will require C++20 support in both the CUDA and host compilers. CUDA 11.8 support will no longer be supported.

Notable Changes:

  • apply() and apply_idx() operators for writing lambda-based custom operators

Full Changelog

  • Add profiling unit tests and fix timer safety by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1060
  • Fixed-size reductions by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1061
  • Fix gcc warning by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1062
  • Added enum documentation for all operators by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1063
  • Support ND operators and transforms to/from python by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1064
  • Add prerun_done_ flag to prevent duplicate PreRun executions in transform operators by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1065
  • Fix some iterator issues that come up with CCCL ToT by @miscco in https://github.com/NVIDIA/MatX/pull/1066
  • Properly use an if constexpr to guard segemented CUB algorithms by @miscco in https://github.com/NVIDIA/MatX/pull/1067
  • Fix cuTENSORNet/cuDSS library path and update to new cuTensorNet API by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1069
  • Added apply() operator by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1072
  • Update stdd docs by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1076
  • Update release container to CUDA 13.0.1 by @tmartin-gh in https://github.com/NVIDIA/MatX/pull/1068
  • Add apply_idx operator for index-aware computations by @cliffburdick in https://github.com/NVIDIA/MatX/pull/1077
  • Fix missing include of `` by @miscco in https://github.com/NVIDIA/MatX/pull/1078

Full Changelog: https://github.com/NVIDIA/MatX/compare/v0.9.3...v0.9.4

Notability

notability 3.0/10

Routine library release, no major traction