ReleaseNovita AINovita AIpublished Jun 5, 2026seen 5d

novitalabs/pegaflow v0.22.6

novitalabs/pegaflow

Open original ↗

Captured source

source ↗
published Jun 5, 2026seen 5dcaptured 10hhttp 200method plain

v0.22.6

Repository: novitalabs/pegaflow

Tag: v0.22.6

Published: 2026-06-05T10:11:02Z

Prerelease: no

Release notes:

0.22.5 -> 0.22.6 对照

0.22.6 是 P/D 分离的第一个 stable 版本:这版把 P/D RDMA push 路径的性能稳定性、正确性保护、布局兼容和发布包完整性一起补齐。0.22.5 是上一版 release baseline;0.22.6 是推荐用于 P/D 分离的正式版本。

| 维度 | 0.22.5 | 0.22.6 | | --- | --- | --- | | P/D 分离 | 基础发布状态,P/D 路径仍在收敛 | 第一版 stable:P/D RDMA push、scheduler/worker 协议、prefill/decode 流程、proxy 和 native RDMA binding 都经过稳定化 | | 性能 | RDMA push 和布局映射还缺少本轮优化 | 支持 MLA cache layout;优化 layout mapping;补齐 connector metrics;提升 RDMA push 发送、完成通知、等待路径和链路利用率观测 | | 正确性 | 对版本不匹配、零命中 query probe、RDMA-only query path 等边界保护不足 | vLLM/server 版本不匹配 fail early;零命中 query probe 不再错误释放 lease;RDMA-only query path 加 cfg-gate;vLLM 启动失败更早暴露 | | 发包 | CUDA 13 wheel 使用 --no-default-features --features cuda-13,会关掉默认 rdma | CUDA 13 CI/release wheel 显式使用 --features cuda-13,rdma;cu12 和 cu13 包都带 RDMA | | 其他修复 | SSD cache 单路径 | SSD cache 支持多路径;清理未使用 Rust 依赖;测试门禁和 e2e cargo feature 对齐 |

0.22.5 -> 0.22.6 Comparison

0.22.6 is the first stable P/D disaggregation release. It closes the loop on P/D RDMA push performance, correctness guards, cache-layout compatibility, and package completeness. 0.22.5 is the previous release baseline; 0.22.6 is the recommended release for stable P/D disaggregation.

| Area | 0.22.5 | 0.22.6 | | --- | --- | --- | | P/D disaggregation | Baseline release state, with the P/D path still converging | First stable release: P/D RDMA push, scheduler/worker protocol, prefill/decode flow, proxy, and native RDMA binding have been stabilized | | Performance | RDMA push and layout mapping had not received this round of optimization | Adds MLA cache layout support; optimizes layout mapping; adds connector metrics; improves RDMA push sending, completion signaling, wait behavior, and link-utilization visibility | | Correctness | Edge cases such as version mismatch, zero-hit query probes, and RDMA-only query paths were under-protected | Fails early on vLLM/server version mismatch; avoids incorrect lease release for zero-hit query probes; cfg-gates RDMA-only query behavior; fails faster when vLLM startup dies | | Packaging | CUDA 13 wheels used --no-default-features --features cuda-13, which disabled default rdma | CUDA 13 CI/release wheels now explicitly use --features cuda-13,rdma; both cu12 and cu13 packages include RDMA | | Other fixes | Single SSD cache path | Multiple SSD cache paths; unused Rust dependencies removed; test gates and e2e cargo features aligned |

Artifacts

  • GitHub Release includes 10 wheels: Python 3.10-3.14 for both pegaflow-llm and pegaflow-llm-cu13.
  • PyPI published pegaflow-llm==0.22.6 and pegaflow-llm-cu13==0.22.6.

Notability

notability 3.0/10

Routine version bump release