NVIDIA/cccl python-0.6.0
NVIDIA/cccl
Captured source
source ↗published Apr 9, 2026seen 5dcaptured 9hhttp 200method plain
CCCL Python Libraries v0.6.0
Repository: NVIDIA/cccl
Tag: python-0.6.0
Published: 2026-04-09T13:27:23Z
Prerelease: no
Release notes: These are the release notes for the cuda-cccl Python package version 0.6.0, dated April 9th, 2026. The previous release was v0.5.1.
cuda-cccl is in "experimental" status, meaning that its API and feature set can change quite rapidly.
Installation
Please refer to the install instructions here
API breaking changes
cuda.cooprefactored to use maker factory functions (#7713)
Features
ShuffleIterator— New iterator type added to cuda.compute (#7721)max_segment_size guarantee— Exposed in the public API (#8284)- LTO-IR support — Can now directly pass LTO-IR for custom operators (#7625)
- Numba-optional install — Added a path to install cuda.compute without Numba as a dependency (#7633)
Performance
- Faster TransformIterator construction (#7660)
Bug Fixes
- Fix faulty pointer arithmetic in CUB dispatch (#7940)
- Fix merge sort returning negative temp storage bytes (#7916)
- Fix histogram build object caching when using privatized smem strategy (#7657)
Notability
notability 3.0/10Routine library release