ReleaseNVIDIANVIDIApublished Apr 22, 2026seen 5d

NVIDIA/TileGym v1.2.0

NVIDIA/TileGym

Open original ↗

Captured source

source ↗
published Apr 22, 2026seen 5dcaptured 8hhttp 200method plain

v1.2.0

Repository: NVIDIA/TileGym

Tag: v1.2.0

Published: 2026-04-22T05:43:38Z

Prerelease: no

Release notes:

What's Changed

  • Add PyPI install instructions to README by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/96
  • Integrate Qwen3.5 with TileGym cuTile Kernels — 2.68x Speedup & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/92
  • cleanup: Remove dead mask variable and make bounds checking explicit in GELU kernel by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/99
  • docs: Update ROADMAP.md statuses & Add more unsloth kernels by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/100
  • Update TileGym Julia kernels to cuTile 0.2 by @maleadt in https://github.com/NVIDIA/TileGym/pull/102
  • Update translated READMEs to match latest English README & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/103
  • perf: gemma_attention CuTile — use approx tanh (rounding_mode=APPROX) for soft cap & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/105
  • Integrate TileGym Kernels for allenai/Olmo-3-1025-7B & [skill] Add cutile auto research by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/106
  • fix(unsloth): fix 6 correctness and performance issues in CuTile RoPE kernels & other updates by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/108
  • [skill] fix test func for perf improvement skills & perf(sm80): tune cuTile kernels for A100 with README updated by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/110
  • fix(ci): render all benchmark columns in summary, not just allowlisted backends by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/109
  • Bump version from 1.1.0 to 1.2.0 by @hannahli-nv in https://github.com/NVIDIA/TileGym/pull/112

New Contributors

  • @maleadt made their first contribution in https://github.com/NVIDIA/TileGym/pull/102

Full Changelog: https://github.com/NVIDIA/TileGym/compare/v1.1.0...v1.2.0

Notability

notability 4.0/10

Routine version update to existing repo