NousResearch/Automodel
forked from NVIDIA-NeMo/Automodel
Captured source
source ↗published May 27, 2026seen 5dcaptured 14hhttp 200method plain
NousResearch/Automodel
Description: 🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
License: Apache-2.0
Stars: 10
Forks: 1
Open issues: 0
Created: 2026-05-27T12:18:10Z
Pushed: 2026-06-04T18:40:48Z
Default branch: main
Fork: yes
Parent repository: NVIDIA-NeMo/Automodel
Archived: no
README:
📣 News and Discussions
- [05/19/2026]**Ling 2.0** We now support finetuning the inclusionAI Ling 2.0 MoE family (
inclusionAI/Ling-mini-2.0,inclusionAI/Ling-flash-2.0, andinclusionAI/Ling-1T), thanks to @Hayden727. Check out our recipes. - [05/17/2026]**ERNIE 4.5** and **MiMo-V2-Flash** We now support finetuning
baidu/ERNIE-4.5-0.3B-PT,baidu/ERNIE-4.5-21B-A3B-PT, andXiaomiMiMo/MiMo-V2-Flash. Check out our ERNIE dense recipe, ERNIE MoE recipe, and MiMo recipe. - [04/29/2026]**Mistral Medium 3.5** We now support finetuning Mistral AI's 128B FP8-native VLM Mistral Medium 3.5. Check out our recipe and guide.
- [04/28/2026]**Nemotron-3-Nano-Omni** We now support finetuning
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16, NVIDIA's 30B-A3B omnimodal MoE (text · image · audio) with NemotronH hybrid Mamba+Attention backbone. Check out our SFT recipe, LoRA recipe, and guide. - [04/28/2026]**Hy3-preview** We now support finetuning
tencent/Hy3-preview, thanks to @Khazic. Check out our recipe. - [04/25/2026]**DeepSeek V4 Flash** We now support finetuning
deepseek-ai/DeepSeek-V4-Flash, thanks to @Khazic. Check out our recipe and guide. - [04/22/2026]**Qwen3.6-27B** We now support finetuning
Qwen/Qwen3.6-27B. Check out our recipe. - [04/20/2026]**Qwen-Image** We now support finetuning
Qwen/Qwen-Image, thanks to @harshareddy832. Check out our recipe. - [04/16/2026]**Qwen3.6 MoE** We now support finetuning
Qwen/Qwen3.6-35B-A3B. Check out our recipe. - [04/16/2026]**LLaVA-OneVision-1.5** We now support finetuning
lmms-lab/LLaVA-OneVision-1.5-4B-Instruct, thanks to @vgauraha62. Check out our recipe. - [04/12/2026]**MiniMax-M2.7** We now support finetuning
MiniMaxAI/MiniMax-M2.7. Check out our recipe. - [04/07/2026]**GLM-5.1** We now support finetuning
zai-org/GLM-5.1. GLM-5.1 is Zhipu AI's latest open-source MoE model featuring MLA + DeepSeek Sparse Attention. Check out our recipe and discussion. - [04/02/2026]**Gemma 4** We support fine-tuning for Gemma4 (2B, 4B, 31B, 26BA4B)! Check out our recipes.
- [03/30/2026]NeMo AutoModel ships with agent-friendly skills in skills/ to help you with common development tasks (e.g., running a recipe, model onboarding, development) across the repo. We welcome PRs that improve existing skills or add new ones.
- [03/16/2026]**Mistral Small 4** We support fine-tuning for Mistral4 119B! Check out our recipe.
- [03/11/2026]**Nemotron Super v3** We support fine-tuning for
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16. Check out our recipe. - [03/11/2026]**GLM-5** We now support finetuning
zai-org/GLM-5. Check out our…
Excerpt shown — open the source for the full document.
Notability
notability 1.0/10Routine fork with low stars.