What does this fork signal mean?

Nous Research forked NousResearch/lighteval (forked from huggingface/lighteval). This fork signal points to upstream code the lab may be inspecting, patching, or building on. High-signal details: repo NousResearch/lighteval · parent huggingface/lighteval · Routine fork with minimal traction. onlylabs links this event to 1 captured evidence page and 6 related fork signals.

Nous Research Fork: NousResearch/lighteval

Captured source

source ↗

GitHub/github.com/NousResearch/lighteval

NousResearch/lighteval repository metadata

Source ↗

published Jun 27, 2025seen 5dcaptured 9hhttp 200method plain

NousResearch/lighteval

Description: Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Language: Python

License: MIT

Stars: 9

Forks: 2

Open issues: 0

Created: 2025-06-27T21:43:44Z

Pushed: 2025-11-21T22:57:48Z

Default branch: nous

Fork: yes

Parent repository: huggingface/lighteval

Archived: no

README:

Your go-to toolkit for lightning-fast, flexible LLM evaluation, from Hugging Face's Leaderboard and Evals Team.

---

Documentation: HF's doc

---

Unlock the Power of LLM Evaluation with Lighteval 🚀

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends—whether it's transformers, tgi, vllm, or nanotron—with ease. Dive deep into your model’s performance by saving and exploring detailed, sample-by-sample results to debug and see how your models stack-up.

Customization at your fingertips: letting you either browse all our existing tasks and metrics or effortlessly create your own custom task and custom metric, tailored to your needs.

Seamlessly experiment, benchmark, and store your results on the Hugging Face Hub, S3, or locally.

🔑 Key Features

Speed: Use vllm as backend for fast evals.
Completeness: Use the accelerate backend to launch any models hosted on Hugging Face.
Seamless Storage: Save results in S3 or Hugging Face Datasets.
Python API: Simple integration with the Python API.
Custom Tasks: Easily add custom tasks.
Versatility: Tons of metrics and tasks ready to go.

⚡️ Installation

Note that lighteval is currently completely untested on Windows, and we don't support it yet. (Should be fully functional on Mac/Linux)

pip install lighteval

Lighteval allows for many extras when installing, see here for a complete list.

If you want to push results to the Hugging Face Hub, add your access token as an environment variable:

huggingface-cli login

🚀 Quickstart

Lighteval offers the following entry points for model evaluation:

lighteval accelerate : evaluate models on CPU or one or more GPUs using [🤗

Accelerate](https://github.com/huggingface/accelerate)

lighteval nanotron: evaluate models in distributed settings using [⚡️

Nanotron](https://github.com/huggingface/nanotron)

lighteval vllm: evaluate models on one or more GPUs using [🚀

VLLM](https://github.com/vllm-project/vllm)

lighteval endpoint
inference-endpoint: evaluate models on one or more GPUs using [🔗

Inference Endpoint](https://huggingface.co/inference-endpoints/dedicated)

tgi: evaluate models on one or more GPUs using 🔗 Text Generation Inference
openai: evaluate models on one or more GPUs using 🔗 OpenAI API

Here’s a quick command to evaluate using the Accelerate backend:

lighteval accelerate \
"model_name=gpt2" \
"leaderboard|truthfulqa:mc|0|0"

🙏 Acknowledgements

Lighteval started as an extension of the fantastic Eleuther AI Harness (which powers the Open LLM Leaderboard) and draws inspiration from the amazing HELM framework.

While evolving Lighteval into its own standalone tool, we are grateful to the Harness and HELM teams for their pioneering work on LLM evaluations.

🌟 Contributions Welcome 💙💚💛💜🧡

Got ideas? Found a bug? Want to add a task or metric? Contributions are warmly welcomed!

If you're adding a new feature, please open an issue first.

If you open a PR, don't forget to run the styling!

pip install -e .[dev]
pre-commit install
pre-commit run --all-files

📜 Citation

@misc{lighteval,
author = {Habib, Nathan and Fourrier, Clémentine and Kydlíček, Hynek and Wolf, Thomas and Tunstall, Lewis},
title = {LightEval: A lightweight framework for LLM evaluation},
year = {2023},
version = {0.8.0},
url = {https://github.com/huggingface/lighteval}
}

Notability

notability 2.0/10

Routine fork with minimal traction