ReleaseNVIDIANVIDIApublished May 1, 2026seen 5d

NVIDIA/cosmos-curator v1.4.0

NVIDIA/cosmos-curator

Open original ↗

Captured source

source ↗
published May 1, 2026seen 5dcaptured 13hhttp 200method plain

Release v1.4.0

Repository: NVIDIA/cosmos-curator

Tag: v1.4.0

Published: 2026-05-01T19:01:46Z

Prerelease: no

Release notes:

Added

  • SAM3-based video object tracking, per-event VLM captioning, serialized SAM3 outputs, an example

event pipeline, and a demo tool.

  • Sensor library support for GPS, IMU, camera intrinsics, and camera extrinsics data.
  • MP4 header validation utilities for video-index checks.
  • Qwen3.5-27B support for image captioning.
  • External OpenAI/Gemini endpoint support for image semantic filtering and classification stages.
  • Async OpenAI/Gemini request handling with batch_size-controlled concurrency for image/video

captioning and external filter/classifier stages.

  • exclusive_end_ns support in make_ts_grid for half-open clip spans.

Fixed

  • Prevent Qwen from falling back to native-resolution inputs during resize.
  • Isolate vLLM async per-window payload handling.
  • Preserve model-variant-specific image filter errors.

Changed

  • Upgrade the cosmos-xenna Python package and submodule to v0.4.0.
  • Add a dedicated sam3 pixi environment for Segment Anything 3 dependencies.
  • Include runtime prompt and config data files in built wheels.

Documentation

  • Reorganize curator documentation into design, guide, and reference sections.
  • Add the interactive Slurm guide.
  • Add GPS and IMU sensor-library design documentation.
  • Update image pipeline documentation, including Qwen3.5 coverage.

Notability

notability 4.0/10

Routine repo release by NVIDIA.