ReleaseNVIDIANVIDIApublished May 9, 2026seen 5d

NVIDIA/nvflow v1.1.0

NVIDIA/nvflow

Open original ↗

Captured source

source ↗
published May 9, 2026seen 5dcaptured 13hhttp 200method plain

Release v1.1.0

Repository: NVIDIA/nvflow

Tag: v1.1.0

Published: 2026-05-09T02:02:06Z

Prerelease: no

Release notes:

Highlights

Container

| Software Component | Version | |---|---| | NeMo-RL | v0.6.0 | | NeMo-Skills | 0229040 (commit) | | vLLM (eval/SDG) | 0.18.1 | | vLLM (GRPO) | 0.17.1 | | sglang | v0.5.10.post1 |

GRPO Multi-Environment Training

Two-environment GRPO pipeline with split configs to prevent cross-environment leaks:

  • equivalence_llm_judge — FSDP v2 backend, 16 GPUs
  • finance_sec_search — Megatron backend with YaRN (131K context), 64 GPUs

Qwen3-30B-A3B Production Pipeline

Full GRPO config for Qwen3-30B-A3B MoE with curriculum ordering, dynamic sampling, and context parallelism.

Rollout Scaling

Scale-independent rollout pipeline with multi-node vLLM, logical chunking, and fault-tolerant multi-seed execution via dependent_jobs.

Eval Pipeline Hardening

  • DTensor v2 safetensors checkpoint conversion with .hf_metadata auto-recreation
  • Separate per-environment eval output directories
  • Standalone eval support (cross-session Slurm dependency handling)

Documentation

  • Quick-start rewrite with per-environment stage-by-stage execution
  • Dual backend guide (FSDP for demo, Megatron for production)

Notability

notability 3.0/10

Routine point release from NVIDIA