RepoNVIDIANVIDIApublished Jan 6, 2026seen 5d

NVIDIA/srt-slurm

Python

Open original ↗

Captured source

source ↗
published Jan 6, 2026seen 5dcaptured 9hhttp 200method plain

NVIDIA/srt-slurm

Description: NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across specific AI use cases across hardware and software combinations.

Language: Python

License: NOASSERTION

Stars: 31

Forks: 53

Open issues: 58

Created: 2026-01-06T01:40:25Z

Pushed: 2026-06-10T20:10:13Z

Default branch: main

Fork: no

Archived: no

README:

srtctl

Command-line tool for distributed LLM inference benchmarks on SLURM clusters using TensorRT LLM, SGLang and vLLM. Replace complex shell scripts and 50+ CLI flags with declarative YAML configuration.

Quick Start

# Clone and install
git clone https://github.com/your-org/srtctl.git
cd srtctl
pip install -e .

# One-time setup (downloads NATS/ETCD, creates srtslurm.yaml)
make setup ARCH=aarch64 # or ARCH=x86_64

Documentation

Full documentation: https://srtctl.gitbook.io/srtctl-docs/

  • [Installation](docs/installation.md) - Setup and configuration
  • [Monitoring](docs/monitoring.md) - Job logs and debugging
  • [Parameter Sweeps](docs/sweeps.md) - Grid searches
  • [Profiling](docs/profiling.md) - Torch/nsys profiling
  • [Analyzing Results](docs/analyzing.md) - Dashboard and visualization

Commands

# Submit job(s)
srtctl apply -f config.yaml

# Submit with custom setup script
srtctl apply -f config.yaml --setup-script custom-setup.sh

# Submit with tags for filtering
srtctl apply -f config.yaml --tags experiment,baseline

# Dry-run (validate without submitting)
srtctl dry-run -f config.yaml

# Launch analysis dashboard
uv run streamlit run analysis/dashboard/app.py

Excerpt shown — open the source for the full document.

Notability

notability 3.0/10

Routine repo with low traction

NVIDIA has a repo signal matching evals and quality, infrastructure.