NVIDIA/srt-slurm
Python
Captured source
source ↗NVIDIA/srt-slurm
Description: NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across specific AI use cases across hardware and software combinations.
Language: Python
License: NOASSERTION
Stars: 31
Forks: 53
Open issues: 58
Created: 2026-01-06T01:40:25Z
Pushed: 2026-06-10T20:10:13Z
Default branch: main
Fork: no
Archived: no
README:
srtctl
Command-line tool for distributed LLM inference benchmarks on SLURM clusters using TensorRT LLM, SGLang and vLLM. Replace complex shell scripts and 50+ CLI flags with declarative YAML configuration.
Quick Start
# Clone and install git clone https://github.com/your-org/srtctl.git cd srtctl pip install -e . # One-time setup (downloads NATS/ETCD, creates srtslurm.yaml) make setup ARCH=aarch64 # or ARCH=x86_64
Documentation
Full documentation: https://srtctl.gitbook.io/srtctl-docs/
- [Installation](docs/installation.md) - Setup and configuration
- [Monitoring](docs/monitoring.md) - Job logs and debugging
- [Parameter Sweeps](docs/sweeps.md) - Grid searches
- [Profiling](docs/profiling.md) - Torch/nsys profiling
- [Analyzing Results](docs/analyzing.md) - Dashboard and visualization
Commands
# Submit job(s) srtctl apply -f config.yaml # Submit with custom setup script srtctl apply -f config.yaml --setup-script custom-setup.sh # Submit with tags for filtering srtctl apply -f config.yaml --tags experiment,baseline # Dry-run (validate without submitting) srtctl dry-run -f config.yaml # Launch analysis dashboard uv run streamlit run analysis/dashboard/app.py
Excerpt shown — open the source for the full document.
Notability
notability 3.0/10Routine repo with low traction
NVIDIA has a repo signal matching evals and quality, infrastructure.