NVIDIA/TileGym v1.2.0
NVIDIA/TileGym
Captured source
source ↗published Apr 22, 2026seen 5dcaptured 8hhttp 200method plain
v1.2.0
Repository: NVIDIA/TileGym
Tag: v1.2.0
Published: 2026-04-22T05:43:38Z
Prerelease: no
Release notes:
What's Changed
- Add PyPI install instructions to README by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/96
- Integrate Qwen3.5 with TileGym cuTile Kernels — 2.68x Speedup & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/92
- cleanup: Remove dead mask variable and make bounds checking explicit in GELU kernel by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/99
- docs: Update ROADMAP.md statuses & Add more unsloth kernels by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/100
- Update TileGym Julia kernels to cuTile 0.2 by @maleadt in https://github.com/NVIDIA/TileGym/pull/102
- Update translated READMEs to match latest English README & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/103
- perf: gemma_attention CuTile — use approx tanh (rounding_mode=APPROX) for soft cap & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/105
- Integrate TileGym Kernels for allenai/Olmo-3-1025-7B & [skill] Add cutile auto research by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/106
- fix(unsloth): fix 6 correctness and performance issues in CuTile RoPE kernels & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/108
- [skill] fix test func for perf improvement skills & perf(sm80): tune cuTile kernels for A100 with README updated by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/110
- fix(ci): render all benchmark columns in summary, not just allowlisted backends by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/109
- Bump version from 1.1.0 to 1.2.0 by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/112
New Contributors
- @maleadt made their first contribution in https://github.com/NVIDIA/TileGym/pull/102
Full Changelog: https://github.com/NVIDIA/TileGym/compare/v1.1.0...v1.2.0
Notability
notability 4.0/10Routine version update to existing repo