What does this model signal mean?

Nous Research published NousResearch/DeepHermes-ToolCalling-Specialist-Atropos. This model signal is evidence of what shipped on model infrastructure and how the release is positioned. High-signal details: license llama3 · 27 HF downloads · A fine-tuned Hermes model specialized for tool calling.. onlylabs links this event to 1 captured evidence page and 6 related model signals.

Nous Research Model: NousResearch/DeepHermes-ToolCalling-Specialist-Atropos

Captured source

source ↗

Hugging Face/huggingface.co/NousResearch/DeepHermes-ToolCalling-Specialist-Atropos

NousResearch/DeepHermes-ToolCalling-Specialist-Atropos model card

Source ↗

published Apr 11, 2025seen Jun 6captured Jun 11http 200method plaintask reinforcement-learninglicense llama3library transformersparams 8Bdownloads 27likes 20

DeepHermes Tool Calling Specialist - Atropos RL

Model Overview

The DeepHermes Tool Calling Specialist - Atropos RL model is an experimental artifact fine-tuned by Nous Research using our innovative open-source reinforcement learning framework—Atropos. This variant specifically improves the tool calling performance of the DeepHermes 3 Llama-3.1 8B model during its reasoning mode.

Note: This model is intended as an experimental artifact and is not designed for broad, general-purpose use.

Atropos Open Source Framework

Atropos is Nous Research’s open-source Reinforcement Learning environment stack, designed to enhance various aspects of LLM functionalities through structured RL methodologies. We encourage contributions and exploration:

🔗 Atropos GitHub Repository

Benchmark Results

Evaluations on the Berkeley Function Calling benchmark demonstrate significant improvements in tool calling accuracy during reasoning mode, compared to its base model:

| Benchmark | Base Accuracy | Atropos RL Accuracy | Improvement | | --------- | ------------- | ------------------- | ----------- | | Parallel | 0.10 | 0.46 | 4.6x | | Simple | 0.21 | 0.5175 | 2.5x |

These enhancements are due to RL fine-tuning specifically targeted at improving reasoning-based tool calling capabilities.

Eval set accuracy results:

!image/png

Key Features

Improved Tool Calling in Reasoning Mode: Reinforcement learning significantly boosts tool usage during complex reasoning tasks.
Open-Source RL Framework: Utilizes the fully open-source Atropos RL Environments.
Active Open Source Community: Contributions welcomed on the Atropos GitHub.
Upcoming SOTA RL Trainer: A state-of-the-art open-source reinforcement learning trainer by Nous Research is coming soon.

Usage

This model supports multiple inference modes including:

Reasoning (Deep Thinking Mode)
Standard Chat/Instruction Mode
Structured JSON Outputs
Function Calling

Detailed documentation and example inference code are available:

*Note: You must first place DeepHermes' reasoning system prompt, and then append your function calling system prompt after for it to do reasoning and tool calling simultaneously.*

🔗 Hermes Function Calling GitHub

How to Cite

@misc{
title={DeepHermes Tool Calling Specialist - Atropos RL},
author={Teknium and Dakota Mahan and Roger Jin and Chen Guang and Jai Suphavadeeprasit and Jeffrey Quesnelle},
year={2025},
url={https://huggingface.co/NousResearch/DeepHermes-Tool-Calling-Specialist-Atropos-RL}
}

Community and Support

For questions, issues, or findings, please open issues or discussions in the respective GitHub repositories:

Nous Research encourages active community engagement and open-source contributions to continuously improve model performance and capabilities.

Notability

notability 3.0/10

Low traction specialist model release