{"schema_version":"onlylabs.public_analysis_evidence.v1","title":"DeepSeek analysis evidence pack","description":"Public onlylabs evidence pack for cited agent analysis: captured pages, ranked public signals, and stored web-search provenance used by the background analysis workflow.","url":"https://onlylabs.fyi/analysis/deepseek","json_url":"https://onlylabs.fyi/analysis/deepseek/evidence.json","generated_at":"2026-06-11T18:05:39.256Z","org":{"slug":"deepseek","name":"DeepSeek","category":"frontier-lab","category_label":"Frontier lab","dossier_url":"https://onlylabs.fyi/labs/deepseek"},"analysis":{"url":"https://onlylabs.fyi/analysis/deepseek","json_url":"https://onlylabs.fyi/analysis/deepseek/analysis.json","generated_at":"2026-06-08T15:59:08.656+00:00"},"workflow":{"version":"synthesize-analyses","provider":null,"model":null,"agent":null,"public_pack_mode":"local-pages-and-events","live_web_fetches":false,"note":"Public evidence exports do not trigger live Exa calls; stored Exa provenance is included when analysis metadata contains it."},"stats":{"pages":28,"events":72,"web":0,"evidence":88,"signal_desks":{"hiring":0,"forks":0,"releases":33,"talking":0,"repos":27},"data_radar_lanes":{"data":1,"evals":0,"infrastructure":4,"safety":0,"product":1},"data_radar_matches":5,"stored_analysis_evidence":null,"stored_analysis_web":null,"stored_analysis_signal_desks":null,"stored_analysis_data_radar_lanes":null,"stored_analysis_data_radar_matches":null},"stored_web_provenance":null,"evidence":[{"ref":"P1","kind":"page","title":"deepseek-ai/DeepSeek-MoE repository metadata","date":"2026-06-11T04:00:08.56497+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-MoE","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-MoE\n\nDescription: DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 1936\n\nForks: 307\n\nOpen issues: 22\n\nCreated: 2024-01-02T03:32:22Z\n\nPushed: 2024-01-16T12:18:10Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"images/logo.svg\" width=\"60%\" alt=\"DeepSeek LLM\" />\n</div>\n<hr>\n<div align=\"center\">\n\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"images/badge.svg\" />\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20LLM-536af5?color=536af5&logoColor=white\" />\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a>\n<a href=\"images/qr.jpeg\" target=\"_blank\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" />\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"LICENSE-CODE\">\n<img alt=\"Code License\" src=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\">\n</a>\n<a href=\"LICENSE-MODEL\">\n<img alt=\"Model License\" src=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\">\n</a>\n</div>\n\n<p align=\"center\">\n<a href=\"#3-model-downloads\">Model Download</a> |\n<a href=\"#2-evaluation-results\">Evaluation Results</a> |\n<a href=\"#4-quick-start\">Quick Start</a> |\n<a href=\"#5-license\">License</a> |\n<a href=\"#6-citation\">Citation</a>\n</p>\n\n<p align=\"center\">\n<a href=\"htt"},{"ref":"P2","kind":"page","title":"deepseek-ai/awesome-deepseek-coder repository metadata","date":"2026-06-11T04:00:06.318143+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/awesome-deepseek-coder","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/awesome-deepseek-coder\n\nDescription: A curated list of open-source projects related to DeepSeek Coder\n\nStars: 791\n\nForks: 211\n\nOpen issues: 0\n\nCreated: 2023-11-06T04:51:04Z\n\nPushed: 2025-11-11T06:45:15Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<p align=\"center\">\n<img width=\"1000px\" alt=\"Awesome DeepSeek Coder\" src=\"images/Awesome_DeepSeek_Coder.png\">\n</p>\n\n📚 [English] | [中文](https://github.com/deepseek-ai/awesome-deepseek-coder/blob/main/README_CN.md)\n<br> \n\n# awesome-deepseek-coder ![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)\n\nA curated list of open-source projects related to DeepSeek Coder.\n\n## Chat with DeepSeek Coder\nChat Website: [chat.deepseek.com](https://chat.deepseek.com/)\n\n## Official Resources\n### Released Models\nAll models are available on Hugging Face: [huggingface.co/deepseek-ai](https://huggingface.co/deepseek-ai)\n| Model Size | Base | Instruct |\n|------------|------|----------|\n| **1.3B** | [deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) | [deepseek-coder-1.3b-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-instruct) |\n| **5.7B** | [deepseek-coder-5.7bmqa-base](https://huggingface.co/deepseek-ai/deepseek-coder-5.7bmqa-base) | [deepseek-coder-5.7bmqa-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-5.7bmqa-instruct)(coming soon) |\n| **6.7B** | [deepseek-coder-6.7B-base](https://huggingface.co/deepseek-ai/deepseek-coder-6.7B-base) | [deepseek-coder-6.7B-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-6.7B-instruct) |\n| **33B** | [deepseek-coder-33B-base](https://huggingface.co/deepseek-ai/deepseek-coder-33B-base) | [deepseek-coder-33B-instruct](https://huggingface.co/deepseek-ai/deepseek-coder-33B-instruct) |\n\n## Community Resources\n\n### Models built upon DeepSeek Coder\n\n| Model Size | Models | \n|------------|------|\n| **1.3B** | [OpenCodeInterpreter-DS-1.3B](https://huggingface.co/m-a-p/OpenCodeInterpreter-DS-1.3B) |\n| **6.7B** | [Magicoder-DS-6.7B](https://huggingface.co/ise-uiuc/Magicoder-DS-6.7B) <br> [Magicoder-S-DS-6.7B](https://huggingface.co/ise-uiuc/Magicoder-S-DS-6"},{"ref":"P3","kind":"page","title":"deepseek-ai/DreamCraft3D repository metadata","date":"2026-06-11T04:00:05.434417+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DreamCraft3D","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DreamCraft3D\n\nDescription: [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 3006\n\nForks: 357\n\nOpen issues: 35\n\nCreated: 2023-10-23T07:40:20Z\n\nPushed: 2025-04-22T11:09:39Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# DreamCraft3D\n\n<a href=\"https://trendshift.io/repositories/4231\" target=\"_blank\"><img src=\"https://trendshift.io/api/badge/repositories/4231\" alt=\"deepseek-ai%2FDreamCraft3D | Trendshift\" style=\"width: 250px; height: 55px;\" width=\"250\" height=\"55\"/></a>\n\n[**Paper**](https://arxiv.org/abs/2310.16818) | [**Project Page**](https://mrtornado24.github.io/DreamCraft3D/) | [**Youtube video**](https://www.youtube.com/watch?v=0FazXENkQms) | [**Replicate demo**](https://replicate.com/jd7h/dreamcraft3d)\n\nOfficial implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior\n\n[Jingxiang Sun](https://mrtornado24.github.io/), [Bo Zhang](https://bo-zhang.me/), [Ruizhi Shao](https://dsaurus.github.io/saurus/), [Lizhen Wang](https://lizhenwangt.github.io/), [Wen Liu](https://github.com/StevenLiuWen), [Zhenda Xie](https://zdaxie.github.io/), [Yebin Liu](https://liuyebin.com/)\n\nAbstract: *We present DreamCraft3D, a hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. We tackle the problem by leveraging a 2D reference image to guide the stages of geometry sculpting and texture boosting. A central focus of this work is to address the consistency issue that existing\nworks encounter. To sculpt geometries that render coherently, we perform score\ndistillation sampling via a view-dependent diffusion model. This 3D prior, alongside several training strategies, prioritizes the geometry consistency but compromises the texture fidelity. We further propose **Bootstrapped Score Distillation** to\nspecifically boost the texture. We train a personalized diffusion model, Dreambooth, on the augmented renderings of the scene, imbuing it with 3D knowledge\nof the scene being optimized. The score distillation from this 3D-aware diffusion prior provides view-consistent guidance for the scene."},{"ref":"P4","kind":"page","title":"deepseek-ai/DeepSeek-Coder repository metadata","date":"2026-06-11T04:00:05.245037+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-Coder","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-Coder\n\nDescription: DeepSeek Coder: Let the Code Write Itself\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 23659\n\nForks: 2845\n\nOpen issues: 163\n\nCreated: 2023-10-20T06:38:01Z\n\nPushed: 2025-11-11T06:44:56Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<p align=\"center\">\n<img width=\"1000px\" alt=\"DeepSeek Coder\" src=\"pictures/logo.png\">\n</p>\n<p align=\"center\"><a href=\"https://www.deepseek.com/\">[<img src=\"pictures/home.png\" width=\"20px\"> Homepage]</a> | <a href=\"https://chat.deepseek.com/\">[🤖 Chat with DeepSeek Coder]</a> | <a href=\"https://huggingface.co/deepseek-ai\">[🤗 Models Download]</a> | <a href=\"https://discord.gg/Tc7c45Zzu5\">[Discord]</a> | <a href=\"https://github.com/guoday/assert/blob/main/QR.png?raw=true\">[WeChat (微信)]</a></p>\n<p align=\"center\">\n<a href=\"https://huggingface.co/papers/2401.14196\"><b>Paper Link</b>👁️</a>\n</p>\n<hr>\n\n### 1. Introduction of DeepSeek Coder\n\nDeepSeek Coder is composed of a series of code language models, each trained from scratch on 2T tokens, with a composition of 87% code and 13% natural language in both English and Chinese. We provide various sizes of the code model, ranging from 1B to 33B versions. Each model is pre-trained on project-level code corpus by employing a window size of 16K and an extra fill-in-the-blank task, to support project-level code completion and infilling. For coding capabilities, DeepSeek Coder achieves state-of-the-art performance among open-source code models on multiple programming languages and various benchmarks.\n\n<p align=\"center\">\n<img src=\"pictures/result.png\" alt=\"result\" width=\"70%\">\n</p>\n\n- **Massive Training Data**: Trained from scratch on 2T tokens, including 87% code and 13% linguistic data in both English and Chinese languages.\n\n- **Highly Flexible & Scalable**: Offered in model sizes of 1B, 5.7B, 6.7B and 33B, enabling users to choose the setup most suitable for their requirements.\n\n- **Superior Model Performance**: State-of-the-art performance among publicly available code models on HumanEval, MultiPL-E, MBPP, DS-1000, and APPS benchmarks.\n\n- **Advanced Code Completion Capabilities**: A window size of 16K and a fill-in-the-blank task, supporting project-le"},{"ref":"P5","kind":"page","title":"deepseek-ai/DeepSeek-VL repository metadata","date":"2026-06-11T04:00:04.499929+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-VL","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-VL\n\nDescription: DeepSeek-VL: Towards Real-World Vision-Language Understanding\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 4126\n\nForks: 592\n\nOpen issues: 48\n\nCreated: 2024-03-07T08:32:57Z\n\nPushed: 2024-04-24T05:01:06Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"images/logo.svg\" width=\"60%\" alt=\"DeepSeek LLM\" />\n</div>\n<hr>\n<div align=\"center\">\n\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"images/badge.svg\" />\n</a>\n<a href=\"https://huggingface.co/spaces/deepseek-ai/DeepSeek-VL-7B\" target=\"_blank\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20VL-536af5?color=536af5&logoColor=white\" />\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a>\n<a href=\"images/qr.jpeg\" target=\"_blank\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" />\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"LICENSE-CODE\">\n<img alt=\"Code License\" src=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\">\n</a>\n<a href=\"LICENSE-MODEL\">\n<img alt=\"Model License\" src=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\">\n</a>\n</div>\n\n<p align=\"center\">\n<a href=\"#3-model-downloads\"><b>📥 Model Download</b></a> |\n<a href=\"#4-quick-start\"><b>⚡ Quick Start</b></a> |\n<a href=\"#5-license\"><b>📜 License</b></a> |\n<a href=\"#6-citation\"><b>📖 Citation</b></a> <br>\n<a href=\"https://arxiv.org/abs/2403.05525\"><b>📄 P"},{"ref":"P6","kind":"page","title":"deepseek-ai/DeepSeek-V2 repository metadata","date":"2026-06-11T04:00:04.449305+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-V2","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-V2\n\nDescription: DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model\n\nLicense: MIT\n\nStars: 5011\n\nForks: 541\n\nOpen issues: 88\n\nCreated: 2024-04-22T06:55:47Z\n\nPushed: 2024-09-25T10:23:55Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-V2\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Homepage\" src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20V2-536af5?color=536af5&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n</div>\n\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoCol"},{"ref":"P7","kind":"page","title":"deepseek-ai/DeepSeek-LLM repository metadata","date":"2026-06-11T04:00:04.201158+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-LLM","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-LLM\n\nDescription: DeepSeek LLM: Let there be answers\n\nLanguage: Makefile\n\nLicense: MIT\n\nStars: 7033\n\nForks: 1092\n\nOpen issues: 55\n\nCreated: 2023-11-29T11:07:49Z\n\nPushed: 2024-02-04T12:22:16Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"images/logo.svg\" width=\"60%\" alt=\"DeepSeek LLM\" />\n</div>\n<hr>\n<div align=\"center\">\n\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"images/badge.svg\" />\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20LLM-536af5?color=536af5&logoColor=white\" />\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a>\n<a href=\"images/qr.jpeg\" target=\"_blank\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" />\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"LICENSE-CODE\">\n<img alt=\"Code License\" src=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\">\n</a>\n<a href=\"LICENSE-MODEL\">\n<img alt=\"Model License\" src=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\">\n</a>\n</div>\n\n<p align=\"center\">\n<a href=\"#2-model-downloads\">Model Download</a> |\n<a href=\"#5-quick-start\">Quick Start</a> |\n<a href=\"#3-evaluation-results\">Evaluation Results</a> |\n<a href=\"#8-license\">License</a> |\n<a href=\"#9-citation\">Citation</a>\n</p>\n\n<p align=\"center\">\n<a href=\"https://arxiv.org/abs/2401.02954\"><b>Paper Link</b>👁️<"},{"ref":"P8","kind":"page","title":"deepseek-ai/awesome-deepseek-integration repository metadata","date":"2026-06-11T04:00:03.72998+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/awesome-deepseek-integration","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/awesome-deepseek-integration\n\nDescription: Integrate the DeepSeek API into popular software\n\nLicense: CC0-1.0\n\nStars: 37819\n\nForks: 4159\n\nOpen issues: 93\n\nCreated: 2024-01-11T02:43:17Z\n\nPushed: 2026-02-23T16:27:47Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<div align=\"center\">\n\n<p align=\"center\">\n<img width=\"1000px\" alt=\"Awesome DeepSeek Integrations\" src=\"docs/Awesome DeepSeek Integrations.png\">\n</p>\n\n# Awesome DeepSeek Integrations ![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)\n\nIntegrate the DeepSeek API into popular softwares. Access [DeepSeek Open Platform](https://platform.deepseek.com/) to get an API key.\n\nEnglish / [简体中文](https://github.com/deepseek-ai/awesome-deepseek-integration/blob/main/README_cn.md) / [繁體中文](https://github.com/deepseek-ai/awesome-deepseek-integration/blob/main/README_zh_tw.md) / [日本語](https://github.com/deepseek-ai/awesome-deepseek-integration/blob/main/README_ja.md) / [Español](https://github.com/deepseek-ai/awesome-deepseek-integration/blob/main/README_es.md)\n\n<a href=\"https://trendshift.io/repositories/12798\" target=\"_blank\"><img src=\"https://trendshift.io/api/badge/repositories/12798\" alt=\"deepseek-ai%2Fawesome-deepseek-integration | Trendshift\" style=\"width: 250px; height: 55px;\" width=\"250\" height=\"55\"/></a>\n</div>\n\n## Table of Contents\n\n- [Awesome DeepSeek Integrations ](#awesome-deepseek-integrations-)\n- [Table of Contents](#table-of-contents)\n- [Project List](#project-list)\n- [Applications](#applications)\n- [AI Agent frameworks](#ai-agent-frameworks)\n- [Data AI Applications frameworks](#data-ai-applications-frameworks)\n- [RAG frameworks](#rag-frameworks)\n- [FHE (Fully Homomorphic Encryption) frameworks](#fhe-fully-homomorphic-encryption-frameworks)\n- [Solana frameworks](#solana-frameworks)\n- [Synthetic data curation](#synthetic-data-curation)\n- [IM Application Plugins](#im-application-plugins)\n- [Office Addin](#office-addin)\n- [Browser Extensions](#browser-extensions)\n- [VS Code Extensions](#vs-code-extensions)\n- [Visual Studio Extensions](#visual-studio-extensions)\n- [neovim Extensions](#neovim-extensions)\n- [JetBrains Extensions]("},{"ref":"P9","kind":"page","title":"deepseek-ai/DeepSeek-Math repository metadata","date":"2026-06-11T04:00:03.477856+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-Math","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-Math\n\nDescription: DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 3319\n\nForks: 582\n\nOpen issues: 43\n\nCreated: 2024-02-05T07:25:51Z\n\nPushed: 2024-04-15T07:55:37Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"images/logo.svg\" width=\"60%\" alt=\"DeepSeek LLM\" />\n</div>\n<hr>\n<div align=\"center\">\n\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"images/badge.svg\" />\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20LLM-536af5?color=536af5&logoColor=white\" />\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n<a href=\"https://replicate.com/cjwbw/deepseek-math-7b-base\" target=\"_parent\"><img src=\"https://replicate.com/cjwbw/deepseek-math-7b-base/badge\" alt=\"Replicate\"/></a> \n</div>\n\n<div align=\"center\">\n\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a>\n<a href=\"images/qr.jpeg\" target=\"_blank\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" />\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"LICENSE-CODE\">\n<img alt=\"Code License\" src=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\">\n</a>\n<a href=\"LICENSE-MODEL\">\n<img alt=\"Model License\" src=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\">\n</a>\n</div>\n\n<p align=\"center\">\n<a href=\"#4-model-downloads\">Model Download</a> |\n<a href=\"#2-evaluation-results\">Evaluation Resu"},{"ref":"P10","kind":"page","title":"deepseek-ai/DeepSeek-R1 repository metadata","date":"2026-06-11T04:00:02.688241+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-R1","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-R1\n\nLicense: MIT\n\nStars: 92010\n\nForks: 11712\n\nOpen issues: 49\n\nCreated: 2025-01-20T11:57:28Z\n\nPushed: 2025-06-27T08:35:54Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# DeepSeek-R1\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-R1\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\" target=\"_blank\"><img alt=\"Homepage\"\nsrc=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\"/></a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\"><img alt=\"Chat\"\nsrc=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20R1-536af5?color=536af5&logoColor=white\"/></a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\"><img alt=\"Hugging Face\"\nsrc=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\"/></a>\n<br>\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\"><img alt=\"Discord\"\nsrc=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\"/></a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\" target=\"_blank\"><img alt=\"WeChat\"\nsrc=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\"/></a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\"><img alt=\"Twitter Follow\"\nsrc=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\"/></a>\n<br>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-R1/blob/main/LICENSE\"><img alt=\"License\"\nsrc=\"https://img.shields.io/badge/License-MIT-f5de53?&color=f5de53\"/></a>\n<br>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf\"><b>Paper Link</b>👁️</a>\n</div>\n\n## 1. Introduction\n\nWe introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. \nDeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary"},{"ref":"P11","kind":"page","title":"deepseek-ai/ESFT repository metadata","date":"2026-06-11T04:00:02.53924+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/ESFT","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/ESFT\n\nDescription: Expert Specialized Fine-Tuning\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 737\n\nForks: 263\n\nOpen issues: 7\n\nCreated: 2024-07-04T09:48:48Z\n\nPushed: 2025-05-22T07:47:15Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Expert-Specialized Fine-Tuning\n\nOfficial Repo for paper [Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models](https://arxiv.org/abs/2407.01906) by \n[Zihan Wang](https://zihanwang314.github.io), [Deli Chen](https://victorchen96.github.io/chendeli.io/), [Damai Dai](https://scholar.google.com.hk/citations?user=8b-ysf0NWVoC&hl=zh-CN), [Runxin Xu](https://runxinxu.github.io/aboutme/), \n[Zhuoshu Li](http://www.idi.zju.edu.cn/member/3053.html) and\nY. Wu. \n\n**ESFT** aims to efficiently customize Large Language Models (LLMs) with Mixture-of-Experts (MoE) architecture by adjusting only task-relevant parts, improving efficiency and performance while using fewer resources and storage. \n\n## 📰 News\n\n📅 **2024.9.20:** Glad to announce that ESFT has been accepted to the **EMNLP 2024 Main Conference**! \n\n📅 **2024.8.11:** We now release the **ESFT training code**! ✨ You can now try it with your own models and dataset!\n\n## 🚀 Quick Start \n### Installation and Setup\n```bash\ngit clone https://github.com/deepseek-ai/ESFT.git\ncd esft\n```\n\n### Install required dependencies\n```bash\npip install transformers torch safetensors accelerate\n```\n\n### Download necessary adapters\n```bash\nbash scripts/download_adapters.sh\n```\n\n## 🔧Key Scripts\n1. **eval_multigpu.py**\nThis script evaluates the performance of the model on various datasets. See **scripts/eval.sh** for detailed configs and explanations.\n\n**Usage:**\n```bash\npython eval_multigpu.py \\\n--eval_dataset=translation \\\n--base_model_path=deepseek-ai/ESFT-vanilla-lite \\\n--adapter_dir=all_models/adapters/token/translation \\\n--output_path=results/completions/token/translation.jsonl \\\n--openai_api_key=YOUR_OPENAI_API_KEY\n```\n\n2. **get_expert_scores.py**\nThis script calculates the scores for each expert based on the evaluation datasets.\n**Usage:**\n```bash\nexport PYTHONPATH=$PYTHONPATH:$(pwd)\npython scripts/expert/get_expert_scores.py \\\n--"},{"ref":"P12","kind":"page","title":"deepseek-ai/DeepSeek-Prover-V1.5 repository metadata","date":"2026-06-11T04:00:02.447661+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-Prover-V1.5","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-Prover-V1.5\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 577\n\nForks: 239\n\nOpen issues: 9\n\nCreated: 2024-08-15T15:10:07Z\n\nPushed: 2024-08-16T03:33:41Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-V2\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Homepage\" src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20V2-536af5?color=536af5&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n</div>\n\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n"},{"ref":"P13","kind":"page","title":"deepseek-ai/DeepSeek-Coder-V2 repository metadata","date":"2026-06-11T04:00:02.36963+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-Coder-V2","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-Coder-V2\n\nDescription: DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence\n\nLicense: MIT\n\nStars: 6833\n\nForks: 1087\n\nOpen issues: 72\n\nCreated: 2024-06-14T03:39:37Z\n\nPushed: 2025-11-11T06:44:45Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-V2\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Homepage\" src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20V2-536af5?color=536af5&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n</div>\n\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo="},{"ref":"P14","kind":"page","title":"deepseek-ai/Janus repository metadata","date":"2026-06-11T04:00:02.204989+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/Janus","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/Janus\n\nDescription: Janus-Series: Unified Multimodal Understanding and Generation Models\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 17749\n\nForks: 2231\n\nOpen issues: 185\n\nCreated: 2024-10-18T03:48:16Z\n\nPushed: 2025-02-01T07:58:29Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"images/logo.svg\" width=\"60%\" alt=\"DeepSeek LLM\" />\n</div>\n<hr>\n\n<div align=\"center\">\n<h1>🚀 Janus-Series: Unified Multimodal Understanding and Generation Models</h1>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"images/badge.svg\" />\n</a>\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<!-- <a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a> -->\n<!-- <a href=\"images/qr.jpeg\" target=\"_blank\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" />\n</a> -->\n<!-- <a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a> -->\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"LICENSE-CODE\">\n<img alt=\"Code License\" src=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\">\n</a>\n<a href=\"LICENSE-MODEL\">\n<img alt=\"Model License\" src=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\">\n</a>\n</div>\n\n<p align=\"center\">\n<a href=\"#2-model-download\"><b>📥 Model Download</b></a> |\n<a href=\"#3-quick-start\"><b>⚡ Quick Start</b></a> |\n<a href=\"#4-license\"><b>📜 License</b></a> |\n<a href=\"#5-citation\"><b>📖 Citation</b></a> <br>\n<!-- 📄 Paper Link (<a href=\"https://arxiv.org/abs/2410.13848\"><b>Janus</b></a>, <a href=\"https://arxiv.or"},{"ref":"P15","kind":"page","title":"deepseek-ai/DeepGEMM repository metadata","date":"2026-06-11T04:00:01.579333+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepGEMM","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepGEMM\n\nDescription: DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling\n\nLanguage: Cuda\n\nLicense: MIT\n\nStars: 7364\n\nForks: 1040\n\nOpen issues: 78\n\nCreated: 2025-02-13T09:09:21Z\n\nPushed: 2026-06-04T06:01:18Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# DeepGEMM\n\nDeepGEMM is a unified, high-performance tensor core kernel library that brings together the key computation primitives of modern large language models — GEMMs (FP8, FP4, BF16), fused MoE with overlapped communication (Mega MoE), MQA scoring for the lightning indexer, HyperConnection (HC), and more — into a single, cohesive CUDA codebase. All kernels are compiled at runtime via a lightweight Just-In-Time (JIT) module, requiring no CUDA compilation during installation.\n\nDeepGEMM leverages some concepts from [CUTLASS](https://github.com/nvidia/cutlass) and [CuTe](https://github.com/NVIDIA/cutlass/tree/main/include/cute), but avoids heavy reliance on their templates or algebras. The library is designed for simplicity, with only a limited number of core kernel functions, making it a clean and accessible resource for learning NVIDIA GPU kernel optimization techniques.\n\nDespite its lightweight design, DeepGEMM's performance matches or exceeds expert-tuned libraries across various matrix shapes.\n\n## News\n\n- 2026.04.16: Mega MoE, FP8xFP4 GEMM, FP4 Indexer, PDL, faster JIT compilation and more.\n- Please see [#304](https://github.com/deepseek-ai/DeepGEMM/pull/304) for more details.\n- For Mega MoE benchmarks, refer to [#316](https://github.com/deepseek-ai/DeepGEMM/pull/316).\n- 2025.09.28: DeepGEMM now supports scoring kernels (weighted ReLU MQA logits) for the lightning indexer for DeepSeek v3.2.\n- Please see [#200](https://github.com/deepseek-ai/DeepGEMM/pull/200) for more details.\n- 2025.07.20: DeepGEMM now supports both SM90/SM100, and has a full refactor with a low-CPU-overhead JIT CPP module.\n- NVRTC and post-compilation SASS optimization are all disabled.\n- NVRTC will be supported later.\n- As NVCC 12.9 will automatically do the FFMA interleaving, all post optimizations will be no longer supported.\n- Please see [#112](https://github.com/deepseek-ai/DeepGEMM/pull/1"},{"ref":"P16","kind":"page","title":"deepseek-ai/DeepSeek-V3 repository metadata","date":"2026-06-11T04:00:01.536933+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-V3","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-V3\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 103739\n\nForks: 16734\n\nOpen issues: 241\n\nCreated: 2024-12-26T09:52:40Z\n\nPushed: 2025-08-28T03:24:37Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-V3\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\"><img alt=\"Homepage\"\nsrc=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\"/></a>\n<a href=\"https://chat.deepseek.com/\"><img alt=\"Chat\"\nsrc=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20V3-536af5?color=536af5&logoColor=white\"/></a>\n<a href=\"https://huggingface.co/deepseek-ai\"><img alt=\"Hugging Face\"\nsrc=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\"/></a>\n<br>\n<a href=\"https://discord.gg/Tc7c45Zzu5\"><img alt=\"Discord\"\nsrc=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\"/></a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\"><img alt=\"Wechat\"\nsrc=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\"/></a>\n<a href=\"https://twitter.com/deepseek_ai\"><img alt=\"Twitter Follow\"\nsrc=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\"/></a>\n<br>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V3/blob/main/LICENSE-CODE\"><img alt=\"Code License\"\nsrc=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\"/></a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V3/blob/main/LICENSE-MODEL\"><img alt=\"Model License\"\nsrc=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\"/></a>\n<br>\n<a href=\"https://arxiv.org/pdf/2412.19437\"><b>Paper Link</b>👁️</a>\n</div>\n\n## Table of Contents\n\n1. [Introduction](#1-introduction)\n2. [Model Summary](#2-model-summary)\n3. [Model Downloads](#3-model-downloads)\n4. [Evaluation Resul"},{"ref":"P17","kind":"page","title":"deepseek-ai/DeepSeek-VL2 repository metadata","date":"2026-06-11T04:00:01.460649+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-VL2","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-VL2\n\nDescription: DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 5298\n\nForks: 1810\n\nOpen issues: 121\n\nCreated: 2024-12-13T04:55:41Z\n\nPushed: 2025-02-26T05:03:42Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"images/logo.svg\" width=\"60%\" alt=\"DeepSeek AI\" />\n</div>\n<hr>\n<div align=\"center\">\n\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"images/badge.svg\" />\n</a>\n<a href=\"https://huggingface.co/spaces/deepseek-ai/deepseek-vl2-small\" target=\"_blank\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20VL-536af5?color=536af5&logoColor=white\" />\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a>\n<a href=\"images/qr.jpeg\" target=\"_blank\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" />\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"LICENSE-CODE\">\n<img alt=\"Code License\" src=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\">\n</a>\n<a href=\"LICENSE-MODEL\">\n<img alt=\"Model License\" src=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\">\n</a>\n</div>\n\n<p align=\"center\">\n<a href=\"https://github.com/deepseek-ai/DeepSeek-VL2/tree/main?tab=readme-ov-file#3-model-download\"><b>📥 Model Download</b></a> |\n<a href=\"https://github.com/deepseek-ai/DeepSeek-VL2/tree/main?tab=readme-ov-file#4-quic"},{"ref":"P18","kind":"page","title":"deepseek-ai/DeepEP repository metadata","date":"2026-06-11T04:00:01.154441+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepEP","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepEP\n\nDescription: DeepEP: an efficient expert-parallel communication library\n\nLanguage: Cuda\n\nLicense: MIT\n\nStars: 9711\n\nForks: 1282\n\nOpen issues: 266\n\nCreated: 2025-02-17T01:33:04Z\n\nPushed: 2026-06-01T02:33:48Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# DeepEP\n\nDeepEP (DeepEveryParallel) is a high-performance communication library for modern machine learning training and inference. The library currently focuses on expert parallelism (EP) — providing high-throughput and low-latency all-to-all GPU kernels (MoE dispatch and combine) with low-precision support including FP8 — while also offering experimental primitives for pipeline parallelism (PP), context parallelism (CP), and remote memory access (Engram), all designed for zero or minimal SM occupation. All kernels are compiled at runtime via a lightweight Just-In-Time (JIT) module, requiring no CUDA compilation during installation.\n\nDespite its lightweight design, DeepEP's performance matches or exceeds hardware bandwidth limits across various configurations.\n\n## News\n\n- **V2 release**: A complete refactoring of Expert Parallelism — achieving extreme performance with several times fewer SM resources compared to V1, while supporting significantly larger scale-up and scale-out domains. V2 has also switched from the NVSHMEM backend to the more lightweight **NCCL Gin backend**.\n\n### New features\n\n- **Fully JIT** (Just-In-Time compilation)\n- **NCCL Gin backend**\n- Header-only & lightweight\n- Able to reuse existing NCCL communicators\n- **EPv2**\n- High-throughput and low-latency APIs unified into a single `ElasticBuffer` interface, with a new GEMM layout\n- Larger scale-up & scale-out domain support (up to EP2048)\n- Analytical SM & QP count calculation — no more auto-tuning needed\n- Both hybrid & direct modes remain supported\n- For V3-like legacy training, SM usage reduced from 24 to 4 - 6 while maintaining equivalent or better performance\n- **0 SM Engram** (with RDMA)\n- **0 SM PP** (with RDMA)\n- **0 SM CP** (with Copy Engine)\n\n### Notes\n\n- Buffer size consumption is larger than V1\n- 0 SM RDMA low-latency EP is no longer supported\n- Engram, PP, and CP are experimental features\n\n### Still "},{"ref":"P19","kind":"page","title":"deepseek-ai/open-infra-index repository metadata","date":"2026-06-11T04:00:00.761142+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/open-infra-index","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/open-infra-index\n\nDescription: Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation\n\nLicense: CC0-1.0\n\nStars: 8003\n\nForks: 287\n\nOpen issues: 1\n\nCreated: 2025-02-21T02:29:19Z\n\nPushed: 2025-05-15T02:00:43Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-Open-Infra\" />\n</div>\n<hr>\n\n# Hello, DeepSeek Open Infra!\n\n## 202505 Industry Track Paper (ISCA25) \n### Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures\n[**📄 Arxiv Paper Link**](https://arxiv.org/abs/2505.09343)\n\n## 202504 [The Path to Open-Sourcing the DeepSeek Inference Engine](OpenSourcing_DeepSeek_Inference_Engine/README.md)\n\n## 202502 Open-Source Week\nWe're a tiny team @deepseek-ai pushing our limits in AGI exploration.\n\nStarting **this week** , Feb 24, 2025 we'll open-source 5 repos – one daily drop – not because we've made grand claims, \nbut simply as developers sharing our small-but-sincere progress with full transparency.\n\nThese are humble building blocks of our online service: documented, deployed, and battle-tested in production. \nNo vaporware, just sincere code that moved our tiny yet ambitious dream forward.\n\nWhy? Because every line shared becomes collective momentum that accelerates the journey.\nDaily unlocks begin soon. No ivory towers - just pure garage-energy and community-driven innovation 🔧\n\nStay tuned – let's geek out in the open together.\n\n### Day 1 - [FlashMLA](https://github.com/deepseek-ai/FlashMLA)\n\n**Efficient MLA Decoding Kernel for Hopper GPUs** \nOptimized for variable-length sequences, battle-tested in production \n\n🔗 [**FlashMLA GitHub Repo**](https://github.com/deepseek-ai/FlashMLA) \n✅ BF16 support \n✅ Paged KV cache (block size 64) \n⚡ Performance: 3000 GB/s memory-bound | BF16 580 TFLOPS compute-bound on H800\n\n### Day 2 - [DeepEP](https://github.com/deepseek-ai/DeepEP)\n\nExcited to introduce **DeepEP** - t"},{"ref":"P20","kind":"page","title":"deepseek-ai/profile-data repository metadata","date":"2026-06-11T04:00:00.609086+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/profile-data","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/profile-data\n\nDescription: Analyze computation-communication overlap in V3/R1.\n\nStars: 1159\n\nForks: 147\n\nOpen issues: 13\n\nCreated: 2025-02-26T07:26:06Z\n\nPushed: 2025-03-21T02:23:51Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Profiling Data in DeepSeek Infra\n\nHere, we publicly share profiling data from our training and inference framework to help the community better understand the communication-computation overlap strategies and low-level implementation details. The profiling data was captured using the PyTorch Profiler. After downloading, you can visualize it directly by navigating to chrome://tracing in the Chrome browser (or edge://tracing in the Edge browser). Notice that we simulate an absolutely balanced MoE routing strategy for profiling.\n\n## Training\n\n[[profile_data]](train.json)\n\n![train](assets/train.jpg)\n\nThe training profile data demonstrates our overlapping strategy for a pair of individual forward and backward chunks in [DualPipe](https://github.com/deepseek-ai/dualpipe). Each chunk contains 4 MoE (Mixture of Experts) layers.\nThe parallel configuration aligns with DeepSeek-V3 pretraining settings: EP64, TP1 with 4K sequence length. And the PP communication is not included during profiling for simplicity.\n\n## Inference\n\n### Prefilling\n\n[[profile_data]](prefill.json)\n\n![prefill](assets/prefill.jpg)\n\nFor prefilling, the profile employs EP32 and TP1 (in line with DeepSeek V3/R1 ’s actual online deployment), with a prompt length set to 4K and a batch size of 16K tokens per GPU. In our prefilling stage, we utilize two micro-batches to overlap computation and all-to-all communication, while ensuring that the attention computation load is balanced across the two micro-batches — meaning that the same prompt may be split between them.\n\n### Decoding\n\n[[profile_data]](decode.json)\n\n![decode](assets/decode.jpg)\n\nFor decoding, the profile employs EP128, TP1, and a prompt length of 4K (closely matching the actual online deployment configuration), with a batch size of 128 requests per GPU. Similar to prefilling, decoding also leverages two micro-batches for overlapping computation and all-to-all communication. However, unlike in prefill"},{"ref":"P21","kind":"page","title":"deepseek-ai/FlashMLA repository metadata","date":"2026-06-11T04:00:00.593416+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/FlashMLA","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/FlashMLA\n\nDescription: FlashMLA: Efficient Multi-head Latent Attention Kernels\n\nLanguage: C++\n\nLicense: MIT\n\nStars: 12698\n\nForks: 1057\n\nOpen issues: 105\n\nCreated: 2025-02-21T06:31:27Z\n\nPushed: 2026-04-30T09:04:01Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# FlashMLA\n\n## Introduction\n\nFlashMLA is DeepSeek's library of optimized attention kernels, powering the [DeepSeek-V3](https://github.com/deepseek-ai/DeepSeek-V3) and [DeepSeek-V3.2-Exp](https://github.com/deepseek-ai/DeepSeek-V3.2-Exp) models. This repository contains the following implementations:\n\n**Sparse Attention Kernels**\n\n*These kernels power DeepSeek Sparse Attention (DSA), as introduced in [this paper](https://github.com/deepseek-ai/DeepSeek-V3.2-Exp).*\n\n- Token-level sparse attention for the prefill stage\n- Token-level sparse attention for the decoding stage, with FP8 KV cache\n\n**Dense Attention Kernels**\n\n- Dense attention for the prefill stage\n- Dense attention for the decoding stage\n\n## News\n\n- **2025.09.29 Release of Sparse Attention Kernels**: With the launch of [DeepSeek-V3.2](https://github.com/deepseek-ai/DeepSeek-V3.2-Exp), we are releasing the corresponding token-level sparse attention kernels. These kernels power the model's DeepSeek Sparse Attention (DSA) and achieve up to 640 TFlops during prefilling and 410 TFlops during decoding. We also release a deep-dive blog for our new FP8 sparse decoding kernel. Check it out [here](docs/20250929-hopper-fp8-sparse-deep-dive.md).\n- **2025.08.01 Kernels for MHA on SM100**: Thanks to [NVIDIA's PR](https://github.com/deepseek-ai/FlashMLA/pull/76) for MHA forward / backward kernels on SM100!\n- **2025.04.22 Deep-Dive Blog**: We'd love to share the technical details behind the new FlashMLA kernel! Check out our deep-dive write-up [here](docs/20250422-new-kernel-deep-dive.md).\n- **2025.04.22 Performance Update**: We're excited to announce the new release of Flash MLA, which delivers 5% ~ 15% performance improvement for compute-bound workloads, achieving up to 660 TFlops on NVIDIA H800 SXM5 GPUs. The interface of the new version is fully compatible with the old one. Simply upgrade to the new version for an immediate performance b"},{"ref":"P22","kind":"page","title":"deepseek-ai/smallpond repository metadata","date":"2026-06-11T04:00:00.592386+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/smallpond","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/smallpond\n\nDescription: A lightweight data processing framework built on DuckDB and 3FS.\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 4960\n\nForks: 444\n\nOpen issues: 32\n\nCreated: 2025-02-24T09:28:17Z\n\nPushed: 2025-03-05T18:23:54Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# smallpond\n\n[![CI](https://github.com/deepseek-ai/smallpond/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/deepseek-ai/smallpond/actions/workflows/ci.yml)\n[![PyPI](https://img.shields.io/pypi/v/smallpond)](https://pypi.org/project/smallpond/)\n[![Docs](https://img.shields.io/badge/docs-latest-brightgreen.svg)](https://deepseek-ai.github.io/smallpond/)\n[![License](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)\n\nA lightweight data processing framework built on [DuckDB] and [3FS].\n\n## Features\n\n- 🚀 High-performance data processing powered by DuckDB\n- 🌍 Scalable to handle PB-scale datasets\n- 🛠️ Easy operations with no long-running services\n\n## Installation\n\nPython 3.8 to 3.12 is supported.\n\n```bash\npip install smallpond\n```\n\n## Quick Start\n\n```bash\n# Download example data\nwget https://duckdb.org/data/prices.parquet\n```\n\n```python\nimport smallpond\n\n# Initialize session\nsp = smallpond.init()\n\n# Load data\ndf = sp.read_parquet(\"prices.parquet\")\n\n# Process data\ndf = df.repartition(3, hash_by=\"ticker\")\ndf = sp.partial_sql(\"SELECT ticker, min(price), max(price) FROM {0} GROUP BY ticker\", df)\n\n# Save results\ndf.write_parquet(\"output/\")\n# Show results\nprint(df.to_pandas())\n```\n\n## Documentation\n\nFor detailed guides and API reference:\n- [Getting Started](docs/source/getstarted.rst)\n- [API Reference](docs/source/api.rst)\n\n## Performance\n\nWe evaluated smallpond using the [GraySort benchmark] ([script]) on a cluster comprising 50 compute nodes and 25 storage nodes running [3FS]. The benchmark sorted 110.5TiB of data in 30 minutes and 14 seconds, achieving an average throughput of 3.66TiB/min.\n\nDetails can be found in [3FS - Gray Sort].\n\n[DuckDB]: https://duckdb.org/\n[3FS]: https://github.com/deepseek-ai/3FS\n[GraySort benchmark]: https://sortbenchmark.org/\n[script]: benchmarks/gray_sort_benchmark.py\n[3FS - Gray Sort]: https://github.com/deepseek-ai/3FS?ta"},{"ref":"P23","kind":"page","title":"deepseek-ai/EPLB repository metadata","date":"2026-06-11T04:00:00.404094+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/EPLB","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/EPLB\n\nDescription: Expert Parallelism Load Balancer\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 1388\n\nForks: 202\n\nOpen issues: 10\n\nCreated: 2025-02-26T10:41:00Z\n\nPushed: 2025-03-24T09:17:10Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Expert Parallelism Load Balancer (EPLB)\n\nWhen using expert parallelism (EP), different experts are assigned to different GPUs. Because the load of different \nexperts may vary depending on the current workload, it is important to keep the load of different GPUs balanced. \nAs described in the DeepSeek-V3 paper, we adopt a **redundant experts** strategy that duplicates heavy-loaded experts. \nThen, we heuristically pack the duplicated experts to GPUs to ensure load balancing across different GPUs. Moreover, \nthanks to the **group-limited expert routing** used in DeepSeek-V3, we also attempt to place the experts of the same \ngroup to the same node to reduce inter-node data traffic, whenever possible.\n\nTo facilitate reproduction and deployment, we open-source our deployed EP load balancing algorithm in `eplb.py`. \nThe algorithm computes a balanced expert replication and placement plan based on the estimated expert loads. Note \nthat the exact method to predict the loads of experts is out of this repo's scope. A common method is to use \nmoving average of historical statistics. \n\n## The Algorithm\n\nThe load balancing algorithm comes with two policies used for different cases.\n\n### Hierarchical Load Balancing\n\nWhen the number of server nodes divides the number of expert groups, we use the hierarchical load balancing policy to\nharness the group-limited expert routing. We first pack the expert groups to nodes evenly, ensuring the loads of \ndifferent nodes are balanced. Then, we replicate the experts within each node. Finally, we pack the replicated experts \nto individual GPUs to ensure different GPUs are load-balanced. The hierarchical load balancing policy can be used in \nprefilling stage with a smaller expert-parallel size.\n\n### Global Load Balancing\n\nIn other cases, we use the global load balancing policy that replicates the experts globally regardless of expert \ngroups, and pack the replicated experts to individual GP"},{"ref":"P24","kind":"page","title":"deepseek-ai/DualPipe repository metadata","date":"2026-06-11T04:00:00.001642+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DualPipe","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DualPipe\n\nDescription: A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 2965\n\nForks: 326\n\nOpen issues: 5\n\nCreated: 2025-02-26T13:29:57Z\n\nPushed: 2026-01-14T06:34:59Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# DualPipe\n\nDualPipe is an innovative bidirectional pipeline parallelism algorithm introduced in the [DeepSeek-V3 Technical Report](https://arxiv.org/pdf/2412.19437). It achieves full overlap of forward and backward computation-communication phases, also reducing pipeline bubbles. For detailed information on computation-communication overlap, please refer to the [profile data](https://github.com/deepseek-ai/profile-data).\n\n### Schedules\n\n![dualpipe](images/dualpipe.png)\n\nExample DualPipe scheduling for 8 PP ranks and 20 micro-batches in two directions.\nThe micro-batches in the reverse direction are symmetric to those in the forward direction, so\nwe omit their batch ID for illustration simplicity. Two cells enclosed by a shared black border\nhave mutually overlapped computation and communication\n\n## DualPipeV\n\nDualPipeV is a concise V-shape schedule derived from DualPipe using a \"cut-in-half\" procedure, introduced by Sea AI Lab as \"Cut-in-half\" in their [blog post](https://hackmd.io/@ufotalent/r1lVXsa9Jg). Thanks to them for this efficient schedule!\n\n### Schedules\n\n![dualpipev](images/dualpipev.png)\n\nExample DualPipeV scheduling for 4 PP ranks (8 PP stages) and 10 micro-batches.\n\n## Pipeline Bubbles and Memory Usage Comparison (based on the same number of PP stages)\n\n| Method | Bubble | Parameter Per Device | Activation Per Device | #Devices |\n|-------------|---------------------------------|----------------------|-----------------------|----------|\n| 1F1B | (*PP*-1)(𝐹+𝐵) | 1× | *PP* | *PP* |\n| ZB1P | (*PP*-1)(𝐹+𝐵-2𝑊) | 1× | *PP* | *PP* |\n| DualPipe | (*PP*/2-1)(𝐹&𝐵+𝐵-3𝑊) | 2× | *PP*+1 | *PP* |\n| DualPipeV | (*PP*/2-1)(𝐹&𝐵+𝐵-3𝑊) | 2× | *PP*+1 | *PP*/2 |\n\n*PP* denotes the number of pp stages (even).\n𝐹 denotes the execution time of a forward chunk, 𝐵 denotes the execution time of a\nfull backward chunk, 𝑊 denotes the e"},{"ref":"P25","kind":"page","title":"deepseek-ai/3FS repository metadata","date":"2026-06-11T03:59:59.934633+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/3FS","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/3FS\n\nDescription: A high-performance distributed file system designed to address the challenges of AI training and inference workloads. \n\nLanguage: C++\n\nLicense: MIT\n\nStars: 9955\n\nForks: 1056\n\nOpen issues: 153\n\nCreated: 2025-02-27T13:36:53Z\n\nPushed: 2026-05-07T09:31:54Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Fire-Flyer File System\n\n[![Build](https://github.com/deepseek-ai/3fs/actions/workflows/build.yml/badge.svg)](https://github.com/deepseek-ai/3fs/actions/workflows/build.yml)\n[![License](https://img.shields.io/badge/LICENSE-MIT-blue.svg)](LICENSE)\n\nThe Fire-Flyer File System (3FS) is a high-performance distributed file system designed to address the challenges of AI training and inference workloads. It leverages modern SSDs and RDMA networks to provide a shared storage layer that simplifies development of distributed applications. Key features and benefits of 3FS include:\n\n- Performance and Usability\n- **Disaggregated Architecture** Combines the throughput of thousands of SSDs and the network bandwidth of hundreds of storage nodes, enabling applications to access storage resource in a locality-oblivious manner.\n- **Strong Consistency** Implements Chain Replication with Apportioned Queries (CRAQ) for strong consistency, making application code simple and easy to reason about.\n- **File Interfaces** Develops stateless metadata services backed by a transactional key-value store (e.g., FoundationDB). The file interface is well known and used everywhere. There is no need to learn a new storage API.\n\n- Diverse Workloads\n- **Data Preparation** Organizes outputs of data analytics pipelines into hierarchical directory structures and manages a large volume of intermediate outputs efficiently.\n- **Dataloaders** Eliminates the need for prefetching or shuffling datasets by enabling random access to training samples across compute nodes.\n- **Checkpointing** Supports high-throughput parallel checkpointing for large-scale training.\n- **KVCache for Inference** Provides a cost-effective alternative to DRAM-based caching, offering high throughput and significantly larger capacity.\n\n## Documentation\n\n* [Design Notes](docs/design_notes.md)\n* [Setup G"},{"ref":"P26","kind":"page","title":"deepseek-ai/DeepSeek-Prover-V2 repository metadata","date":"2026-06-11T03:59:59.835689+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-Prover-V2","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-Prover-V2\n\nLicense: NOASSERTION\n\nStars: 1273\n\nForks: 102\n\nOpen issues: 13\n\nCreated: 2025-04-30T13:42:57Z\n\nPushed: 2025-07-18T08:11:37Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-V3\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\"><img alt=\"Homepage\"\nsrc=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\"/></a>\n<a href=\"https://chat.deepseek.com/\"><img alt=\"Chat\"\nsrc=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20V3-536af5?color=536af5&logoColor=white\"/></a>\n<a href=\"https://huggingface.co/deepseek-ai\"><img alt=\"Hugging Face\"\nsrc=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\"/></a>\n<br>\n<a href=\"https://discord.gg/Tc7c45Zzu5\"><img alt=\"Discord\"\nsrc=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\"/></a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\"><img alt=\"Wechat\"\nsrc=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\"/></a>\n<a href=\"https://twitter.com/deepseek_ai\"><img alt=\"Twitter Follow\"\nsrc=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\"/></a>\n<br>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V3/blob/main/LICENSE-CODE\"><img alt=\"Code License\"\nsrc=\"https://img.shields.io/badge/Code_License-MIT-f5de53?&color=f5de53\"/></a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V3/blob/main/LICENSE-MODEL\"><img alt=\"Model License\"\nsrc=\"https://img.shields.io/badge/Model_License-Model_Agreement-f5de53?&color=f5de53\"/></a>\n<br>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-Prover-V2/blob/master/DeepSeek_Prover_V2.pdf\"><b>Paper Link</b>👁️</a>\n</div>\n<p align=\"center\">\n<a href=\"#2-model-summary\">Model Summary</a> |\n<a href=\"#3-proverbench\">ProverBench</a> |\n<a "},{"ref":"P27","kind":"page","title":"deepseek-ai/DeepSeek-V3.2-Exp repository metadata","date":"2026-06-11T03:59:59.82968+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-V3.2-Exp","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-V3.2-Exp\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 1601\n\nForks: 176\n\nOpen issues: 31\n\nCreated: 2025-09-29T03:25:13Z\n\nPushed: 2025-11-18T02:57:11Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# DeepSeek-V3.2-Exp\n\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/logo.svg?raw=true\" width=\"60%\" alt=\"DeepSeek-V3\" />\n</div>\n<hr>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://www.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Homepage\" src=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/badge.svg?raw=true\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://chat.deepseek.com/\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Chat\" src=\"https://img.shields.io/badge/🤖%20Chat-DeepSeek%20V3-536af5?color=536af5&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://huggingface.co/deepseek-ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n</div>\n<div align=\"center\" style=\"line-height: 1;\">\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://github.com/deepseek-ai/DeepSeek-V2/blob/main/figures/qr.jpeg?raw=true\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Wechat\" src=\"https://img.shields.io/badge/WeChat-DeepSeek%20AI-brightgreen?logo=wechat&logoColor=white\" style=\"display: inline-block; vertical-align: middle;\"/>\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\" style=\"margin: 2px;\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" style=\"display: inline-block; vertical-alig"},{"ref":"P28","kind":"page","title":"deepseek-ai/DeepSeek-OCR repository metadata","date":"2026-06-11T03:59:59.708668+00:00","date_source":null,"source_url":"https://github.com/deepseek-ai/DeepSeek-OCR","signal_url":null,"signal_json_url":null,"text":"# deepseek-ai/DeepSeek-OCR\n\nDescription: Contexts Optical Compression\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 23271\n\nForks: 2152\n\nOpen issues: 286\n\nCreated: 2025-10-17T06:14:27Z\n\nPushed: 2026-01-27T03:45:14Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<!-- markdownlint-disable first-line-h1 -->\n<!-- markdownlint-disable html -->\n<!-- markdownlint-disable no-duplicate-header -->\n\n<div align=\"center\">\n<img src=\"assets/logo.svg\" width=\"60%\" alt=\"DeepSeek AI\" />\n</div>\n\n<hr>\n<div align=\"center\">\n<a href=\"https://www.deepseek.com/\" target=\"_blank\">\n<img alt=\"Homepage\" src=\"assets/badge.svg\" />\n</a>\n<a href=\"https://huggingface.co/deepseek-ai/DeepSeek-OCR\" target=\"_blank\">\n<img alt=\"Hugging Face\" src=\"https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-DeepSeek%20AI-ffc107?color=ffc107&logoColor=white\" />\n</a>\n\n</div>\n\n<div align=\"center\">\n\n<a href=\"https://discord.gg/Tc7c45Zzu5\" target=\"_blank\">\n<img alt=\"Discord\" src=\"https://img.shields.io/badge/Discord-DeepSeek%20AI-7289da?logo=discord&logoColor=white&color=7289da\" />\n</a>\n<a href=\"https://twitter.com/deepseek_ai\" target=\"_blank\">\n<img alt=\"Twitter Follow\" src=\"https://img.shields.io/badge/Twitter-deepseek_ai-white?logo=x&logoColor=white\" />\n</a>\n\n</div>\n\n<p align=\"center\">\n<a href=\"https://huggingface.co/deepseek-ai/DeepSeek-OCR\"><b>📥 Model Download</b></a> |\n<a href=\"https://github.com/deepseek-ai/DeepSeek-OCR/blob/main/DeepSeek_OCR_paper.pdf\"><b>📄 Paper Link</b></a> |\n<a href=\"https://arxiv.org/abs/2510.18234\"><b>📄 Arxiv Paper Link</b></a> |\n</p>\n\n<h2>\n<p align=\"center\">\n<a href=\"\">DeepSeek-OCR: Contexts Optical Compression</a>\n</p>\n</h2>\n\n<p align=\"center\">\n<img src=\"assets/fig1.png\" style=\"width: 1000px\" align=center>\n</p>\n<p align=\"center\">\n<a href=\"\">Explore the boundaries of visual-text compression.</a> \n</p>\n\n## Release\n- [2026/01/27]🚀🚀🚀🚀🚀🚀 We present [DeepSeek-OCR2](https://github.com/deepseek-ai/DeepSeek-OCR-2)\n- [2025/10/23]🚀🚀🚀 DeepSeek-OCR is now officially supported in upstream [vLLM](https://docs.vllm.ai/projects/recipes/en/latest/DeepSeek/DeepSeek-OCR.html#installing-vllm). Thanks to the [vLLM](https://github.com/vllm-project/vllm) team for their help.\n- [2025/10/20]🚀"},{"ref":"E1","kind":"event","title":"deepseek-ai/DeepSeek-R1","date":"2025-01-20T03:46:07+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1","signal_url":"https://onlylabs.fyi/signals/0f9a1ff3-aa44-4860-bde1-001f162840d9","signal_json_url":"https://onlylabs.fyi/signals/0f9a1ff3-aa44-4860-bde1-001f162840d9/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1 · signal_desk=releases · occurred_at=2025-01-20T03:46:07+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1 · hf_downloads=5354651 · hf_likes=13382 · hf_params=684531386000 · pipeline=text-generation · license=mit"},{"ref":"E2","kind":"event","title":"deepseek-ai/DeepSeek-OCR","date":"2025-10-17T06:14:27+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-OCR","signal_url":"https://onlylabs.fyi/signals/db7c9ccb-8c79-458d-8a77-d5d20b2447f5","signal_json_url":"https://onlylabs.fyi/signals/db7c9ccb-8c79-458d-8a77-d5d20b2447f5/signal.json","text":"repo_new · deepseek-ai/DeepSeek-OCR · signal_desk=repos · occurred_at=2025-10-17T06:14:27+00:00 · url=https://github.com/deepseek-ai/DeepSeek-OCR · stars=23272 · hn=1003 points/244 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-OCR\",\"description\":\"Contexts Optical Compression\",\"language\":\"Python\"}"},{"ref":"E3","kind":"event","title":"deepseek-ai/DeepSeek-V3","date":"2024-12-25T12:52:23+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3","signal_url":"https://onlylabs.fyi/signals/09de0978-f669-4b89-bd32-38491c9350a3","signal_json_url":"https://onlylabs.fyi/signals/09de0978-f669-4b89-bd32-38491c9350a3/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3 · signal_desk=releases · occurred_at=2024-12-25T12:52:23+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3 · hf_downloads=1000881 · hf_likes=4086 · hf_params=684531386000 · pipeline=text-generation"},{"ref":"E4","kind":"event","title":"deepseek-ai/Janus-Pro-7B","date":"2025-01-26T12:05:50+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/Janus-Pro-7B","signal_url":"https://onlylabs.fyi/signals/2c0082b3-c4b8-4246-97f3-87de3b6ac91e","signal_json_url":"https://onlylabs.fyi/signals/2c0082b3-c4b8-4246-97f3-87de3b6ac91e/signal.json","text":"model_released · deepseek-ai/Janus-Pro-7B · signal_desk=releases · occurred_at=2025-01-26T12:05:50+00:00 · url=https://huggingface.co/deepseek-ai/Janus-Pro-7B · hf_downloads=10889 · hf_likes=3616 · pipeline=any-to-any · license=mit"},{"ref":"E5","kind":"event","title":"deepseek-ai/DeepSeek-OCR","date":"2025-10-17T06:22:05+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-OCR","signal_url":"https://onlylabs.fyi/signals/13bd7946-2594-4173-aa3a-8da00df6c14e","signal_json_url":"https://onlylabs.fyi/signals/13bd7946-2594-4173-aa3a-8da00df6c14e/signal.json","text":"model_released · deepseek-ai/DeepSeek-OCR · signal_desk=releases · occurred_at=2025-10-17T06:22:05+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-OCR · hf_downloads=2352781 · hf_likes=3277 · hf_params=3336106240 · pipeline=image-text-to-text · license=mit"},{"ref":"E6","kind":"event","title":"deepseek-ai/DeepSeek-V3-0324","date":"2025-03-24T09:28:22+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3-0324","signal_url":"https://onlylabs.fyi/signals/0c96e564-7937-4f9d-98be-842243f1c6b4","signal_json_url":"https://onlylabs.fyi/signals/0c96e564-7937-4f9d-98be-842243f1c6b4/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3-0324 · signal_desk=releases · occurred_at=2025-03-24T09:28:22+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3-0324 · hf_downloads=829899 · hf_likes=3127 · hf_params=684531386000 · pipeline=text-generation · license=mit"},{"ref":"E7","kind":"event","title":"deepseek-ai/DeepSeek-R1-0528","date":"2025-05-28T09:46:42+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-0528","signal_url":"https://onlylabs.fyi/signals/ced531f8-6671-48f1-8351-61b0ab74f839","signal_json_url":"https://onlylabs.fyi/signals/ced531f8-6671-48f1-8351-61b0ab74f839/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-0528 · signal_desk=releases · occurred_at=2025-05-28T09:46:42+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-0528 · hf_downloads=6486312 · hf_likes=2451 · hf_params=684531386000 · pipeline=text-generation · license=mit"},{"ref":"E8","kind":"event","title":"deepseek-ai/DeepSeek-V3-Base","date":"2024-12-25T12:52:06+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3-Base","signal_url":"https://onlylabs.fyi/signals/08240b36-465e-440b-aa6e-a58ab3787826","signal_json_url":"https://onlylabs.fyi/signals/08240b36-465e-440b-aa6e-a58ab3787826/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3-Base · signal_desk=releases · occurred_at=2024-12-25T12:52:06+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3-Base · hf_downloads=5943 · hf_likes=1696 · hf_params=684531386000"},{"ref":"E9","kind":"event","title":"deepseek-ai/DeepSeek-V3.2","date":"2025-12-01T02:34:49+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3.2","signal_url":"https://onlylabs.fyi/signals/ba339d3c-d329-4096-8664-058e2a88d646","signal_json_url":"https://onlylabs.fyi/signals/ba339d3c-d329-4096-8664-058e2a88d646/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3.2 · signal_desk=releases · occurred_at=2025-12-01T02:34:49+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3.2 · hf_downloads=3585856 · hf_likes=1446 · hf_params=685396921376 · pipeline=text-generation · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E10","kind":"event","title":"deepseek-ai/DeepSeek-R1-Distill-Qwen-32B","date":"2025-01-20T09:19:00+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B","signal_url":"https://onlylabs.fyi/signals/05c727c7-1f56-4a32-bde8-434ee4bfaf6e","signal_json_url":"https://onlylabs.fyi/signals/05c727c7-1f56-4a32-bde8-434ee4bfaf6e/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Distill-Qwen-32B · signal_desk=releases · occurred_at=2025-01-20T09:19:00+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B · hf_downloads=488395 · hf_likes=1564 · hf_params=32763876352 · pipeline=text-generation · license=mit"},{"ref":"E11","kind":"event","title":"deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B","date":"2025-01-20T09:04:18+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B","signal_url":"https://onlylabs.fyi/signals/a51d40df-b9ae-4304-bc21-f665eeba1e62","signal_json_url":"https://onlylabs.fyi/signals/a51d40df-b9ae-4304-bc21-f665eeba1e62/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B · signal_desk=releases · occurred_at=2025-01-20T09:04:18+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B · hf_downloads=699316 · hf_likes=1521 · hf_params=1777088000 · pipeline=text-generation · license=mit"},{"ref":"E12","kind":"event","title":"deepseek-ai/DeepSeek-V3.2-Exp","date":"2025-09-29T03:25:13+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-V3.2-Exp","signal_url":"https://onlylabs.fyi/signals/4901a6e8-565c-42d7-a9db-c35f83aab1ed","signal_json_url":"https://onlylabs.fyi/signals/4901a6e8-565c-42d7-a9db-c35f83aab1ed/signal.json","text":"repo_new · deepseek-ai/DeepSeek-V3.2-Exp · signal_desk=repos · occurred_at=2025-09-29T03:25:13+00:00 · url=https://github.com/deepseek-ai/DeepSeek-V3.2-Exp · stars=1601 · hn=309 points/50 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-V3.2-Exp\",\"language\":\"Python\"}"},{"ref":"E13","kind":"event","title":"deepseek-ai/DeepSeek-V3","date":"2024-12-26T09:52:40+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-V3","signal_url":"https://onlylabs.fyi/signals/76a4e307-9f93-41d7-a966-040b3b321cab","signal_json_url":"https://onlylabs.fyi/signals/76a4e307-9f93-41d7-a966-040b3b321cab/signal.json","text":"repo_new · deepseek-ai/DeepSeek-V3 · signal_desk=repos · occurred_at=2024-12-26T09:52:40+00:00 · url=https://github.com/deepseek-ai/DeepSeek-V3 · stars=103740 · hn=125 points/40 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-V3\",\"language\":\"Python\"}"},{"ref":"E14","kind":"event","title":"deepseek-ai/DeepSeek-R1-0528-Qwen3-8B","date":"2025-05-29T11:07:47+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B","signal_url":"https://onlylabs.fyi/signals/37bc516f-d124-4892-bea9-092462cf914c","signal_json_url":"https://onlylabs.fyi/signals/37bc516f-d124-4892-bea9-092462cf914c/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · signal_desk=releases · occurred_at=2025-05-29T11:07:47+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-0528-Qwen3-8B · hf_downloads=310634 · hf_likes=1071 · hf_params=8190735360 · pipeline=text-generation · license=mit"},{"ref":"E15","kind":"event","title":"deepseek-ai/DeepSeek-OCR-2","date":"2026-01-27T02:56:54+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-OCR-2","signal_url":"https://onlylabs.fyi/signals/d97ef29f-cfc6-45f5-ac35-b7f8ee288d1c","signal_json_url":"https://onlylabs.fyi/signals/d97ef29f-cfc6-45f5-ac35-b7f8ee288d1c/signal.json","text":"model_released · deepseek-ai/DeepSeek-OCR-2 · signal_desk=releases · occurred_at=2026-01-27T02:56:54+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-OCR-2 · hf_downloads=1826461 · hf_likes=981 · hf_params=3389119360 · pipeline=image-text-to-text · license=apache-2.0"},{"ref":"E16","kind":"event","title":"deepseek-ai/DeepSeek-V3.1-Base","date":"2025-08-19T10:01:00+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base","signal_url":"https://onlylabs.fyi/signals/f7b11198-66b9-469c-b8eb-0488544b0afa","signal_json_url":"https://onlylabs.fyi/signals/f7b11198-66b9-469c-b8eb-0488544b0afa/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3.1-Base · signal_desk=releases · occurred_at=2025-08-19T10:01:00+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Base · hf_downloads=8892 · hf_likes=1010 · hf_params=684531386000 · pipeline=text-generation · license=mit"},{"ref":"E17","kind":"event","title":"deepseek-ai/DeepSeek-V3.2-Exp","date":"2025-09-29T06:07:26+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp","signal_url":"https://onlylabs.fyi/signals/b26a6f5a-daff-4ce9-b70f-7805fa4c1ca8","signal_json_url":"https://onlylabs.fyi/signals/b26a6f5a-daff-4ce9-b70f-7805fa4c1ca8/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3.2-Exp · signal_desk=releases · occurred_at=2025-09-29T06:07:26+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp · hf_downloads=162267 · hf_likes=992 · hf_params=685396921376 · pipeline=text-generation · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E18","kind":"event","title":"deepseek-ai/DeepSeek-R1-Zero","date":"2025-01-20T03:35:48+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Zero","signal_url":"https://onlylabs.fyi/signals/8b3324e1-d660-4a9e-a5fb-01395bdc6de3","signal_json_url":"https://onlylabs.fyi/signals/8b3324e1-d660-4a9e-a5fb-01395bdc6de3/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Zero · signal_desk=releases · occurred_at=2025-01-20T03:35:48+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Zero · hf_downloads=7825 · hf_likes=958 · hf_params=684531386000 · pipeline=text-generation · license=mit"},{"ref":"E19","kind":"event","title":"deepseek-ai/DeepSeek-R1-Distill-Llama-8B","date":"2025-01-20T09:09:42+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B","signal_url":"https://onlylabs.fyi/signals/ef547cb4-1722-4e1f-a4e9-a21b6d0844b4","signal_json_url":"https://onlylabs.fyi/signals/ef547cb4-1722-4e1f-a4e9-a21b6d0844b4/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Distill-Llama-8B · signal_desk=releases · occurred_at=2025-01-20T09:09:42+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B · hf_downloads=437492 · hf_likes=863 · hf_params=8030261248 · pipeline=text-generation · license=mit"},{"ref":"E20","kind":"event","title":"deepseek-ai/DeepSeek-R1-Distill-Qwen-7B","date":"2025-01-20T09:16:14+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B","signal_url":"https://onlylabs.fyi/signals/23e22e7c-4f8f-4efa-8357-155add940712","signal_json_url":"https://onlylabs.fyi/signals/23e22e7c-4f8f-4efa-8357-155add940712/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Distill-Qwen-7B · signal_desk=releases · occurred_at=2025-01-20T09:16:14+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B · hf_downloads=483057 · hf_likes=841 · hf_params=7615616512 · pipeline=text-generation · license=mit"},{"ref":"E21","kind":"event","title":"deepseek-ai/DeepSeek-Prover-V2-671B","date":"2025-04-30T06:14:35+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-671B","signal_url":"https://onlylabs.fyi/signals/06af1777-c412-4081-aa75-06d141b3a264","signal_json_url":"https://onlylabs.fyi/signals/06af1777-c412-4081-aa75-06d141b3a264/signal.json","text":"model_released · deepseek-ai/DeepSeek-Prover-V2-671B · signal_desk=releases · occurred_at=2025-04-30T06:14:35+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-671B · hf_downloads=633 · hf_likes=830 · hf_params=684531386000 · pipeline=text-generation"},{"ref":"E22","kind":"event","title":"deepseek-ai/DeepSeek-R1-Distill-Llama-70B","date":"2025-01-20T09:13:33+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B","signal_url":"https://onlylabs.fyi/signals/be6db875-18ee-4f22-bcd2-ef51cbdb2e49","signal_json_url":"https://onlylabs.fyi/signals/be6db875-18ee-4f22-bcd2-ef51cbdb2e49/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Distill-Llama-70B · signal_desk=releases · occurred_at=2025-01-20T09:13:33+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B · hf_downloads=115254 · hf_likes=777 · hf_params=70553706496 · pipeline=text-generation · license=mit"},{"ref":"E23","kind":"event","title":"deepseek-ai/awesome-deepseek-integration","date":"2024-01-11T02:43:17+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/awesome-deepseek-integration","signal_url":"https://onlylabs.fyi/signals/cb4d3845-2c60-42c1-bd71-85b3cddf859d","signal_json_url":"https://onlylabs.fyi/signals/cb4d3845-2c60-42c1-bd71-85b3cddf859d/signal.json","text":"repo_new · deepseek-ai/awesome-deepseek-integration · signal_desk=repos · occurred_at=2024-01-11T02:43:17+00:00 · url=https://github.com/deepseek-ai/awesome-deepseek-integration · stars=37819 · hn=116 points/24 comments · raw={\"repo\":\"deepseek-ai/awesome-deepseek-integration\",\"description\":\"Integrate the DeepSeek API into popular software\"}"},{"ref":"E24","kind":"event","title":"deepseek-ai/DeepSeek-V3.2-Speciale","date":"2025-11-28T03:00:02+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale","signal_url":"https://onlylabs.fyi/signals/532117f6-71d9-4ab2-9e3d-494bb7606466","signal_json_url":"https://onlylabs.fyi/signals/532117f6-71d9-4ab2-9e3d-494bb7606466/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3.2-Speciale · signal_desk=releases · occurred_at=2025-11-28T03:00:02+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale · hf_downloads=9752 · hf_likes=708 · hf_params=685396921376 · pipeline=text-generation · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E25","kind":"event","title":"deepseek-ai/DeepSeek-R1-Distill-Qwen-14B","date":"2025-01-20T09:18:27+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B","signal_url":"https://onlylabs.fyi/signals/9d501bbb-9451-4562-97de-11973a84b2c3","signal_json_url":"https://onlylabs.fyi/signals/9d501bbb-9451-4562-97de-11973a84b2c3/signal.json","text":"model_released · deepseek-ai/DeepSeek-R1-Distill-Qwen-14B · signal_desk=releases · occurred_at=2025-01-20T09:18:27+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B · hf_downloads=491451 · hf_likes=654 · hf_params=14770033664 · license=mit"},{"ref":"E26","kind":"event","title":"deepseek-ai/DeepSeek-Math","date":"2024-02-05T07:25:51+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-Math","signal_url":"https://onlylabs.fyi/signals/ebdf9e46-3db4-424d-9d62-78d1553d0eda","signal_json_url":"https://onlylabs.fyi/signals/ebdf9e46-3db4-424d-9d62-78d1553d0eda/signal.json","text":"repo_new · deepseek-ai/DeepSeek-Math · signal_desk=repos · occurred_at=2024-02-05T07:25:51+00:00 · url=https://github.com/deepseek-ai/DeepSeek-Math · stars=3319 · hn=114 points/39 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-Math\",\"description\":\"DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models\",\"language\":\"Python\"}"},{"ref":"E27","kind":"event","title":"deepseek-ai/Janus-Pro-1B","date":"2025-01-26T12:07:49+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/Janus-Pro-1B","signal_url":"https://onlylabs.fyi/signals/7e8c1bab-3fd5-4967-812c-f3bcaae3b457","signal_json_url":"https://onlylabs.fyi/signals/7e8c1bab-3fd5-4967-812c-f3bcaae3b457/signal.json","text":"model_released · deepseek-ai/Janus-Pro-1B · signal_desk=releases · occurred_at=2025-01-26T12:07:49+00:00 · url=https://huggingface.co/deepseek-ai/Janus-Pro-1B · hf_downloads=14498 · hf_likes=477 · pipeline=any-to-any · license=mit"},{"ref":"E28","kind":"event","title":"deepseek-ai/DeepSeek-R1","date":"2025-01-20T11:57:28+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-R1","signal_url":"https://onlylabs.fyi/signals/f4c1e6fc-b661-443e-8a6e-7f98418cd993","signal_json_url":"https://onlylabs.fyi/signals/f4c1e6fc-b661-443e-8a6e-7f98418cd993/signal.json","text":"repo_new · deepseek-ai/DeepSeek-R1 · signal_desk=repos · occurred_at=2025-01-20T11:57:28+00:00 · url=https://github.com/deepseek-ai/DeepSeek-R1 · stars=92010 · raw={\"repo\":\"deepseek-ai/DeepSeek-R1\"}"},{"ref":"E29","kind":"event","title":"deepseek-ai/deepseek-vl2","date":"2024-12-13T09:06:44+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/deepseek-vl2","signal_url":"https://onlylabs.fyi/signals/82a5ecf0-ed27-4039-befb-177de6d773f1","signal_json_url":"https://onlylabs.fyi/signals/82a5ecf0-ed27-4039-befb-177de6d773f1/signal.json","text":"model_released · deepseek-ai/deepseek-vl2 · signal_desk=releases · occurred_at=2024-12-13T09:06:44+00:00 · url=https://huggingface.co/deepseek-ai/deepseek-vl2 · hf_downloads=2680 · hf_likes=385 · hf_params=27480134248 · pipeline=image-text-to-text · license=other"},{"ref":"E30","kind":"event","title":"deepseek-ai/DeepSeek-V4-Pro-Base","date":"2026-04-22T06:04:35+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-Base","signal_url":"https://onlylabs.fyi/signals/906adf75-ac56-484b-acee-a3c5a3156607","signal_json_url":"https://onlylabs.fyi/signals/906adf75-ac56-484b-acee-a3c5a3156607/signal.json","text":"model_released · deepseek-ai/DeepSeek-V4-Pro-Base · signal_desk=releases · occurred_at=2026-04-22T06:04:35+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-Base · hf_downloads=20063 · hf_likes=291 · hf_params=1600790440862"},{"ref":"E31","kind":"event","title":"deepseek-ai/deepseek-vl2-tiny","date":"2024-12-13T08:49:22+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/deepseek-vl2-tiny","signal_url":"https://onlylabs.fyi/signals/c6a29a51-2394-47b6-bfa5-98c6d9dbe6a6","signal_json_url":"https://onlylabs.fyi/signals/c6a29a51-2394-47b6-bfa5-98c6d9dbe6a6/signal.json","text":"model_released · deepseek-ai/deepseek-vl2-tiny · signal_desk=releases · occurred_at=2024-12-13T08:49:22+00:00 · url=https://huggingface.co/deepseek-ai/deepseek-vl2-tiny · hf_downloads=754397 · hf_likes=248 · hf_params=3370501440 · pipeline=image-text-to-text · license=other"},{"ref":"E32","kind":"event","title":"deepseek-ai/Janus","date":"2024-10-18T03:48:16+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/Janus","signal_url":"https://onlylabs.fyi/signals/77c51133-68c1-4dfb-9533-66229903c51b","signal_json_url":"https://onlylabs.fyi/signals/77c51133-68c1-4dfb-9533-66229903c51b/signal.json","text":"repo_new · deepseek-ai/Janus · signal_desk=repos · occurred_at=2024-10-18T03:48:16+00:00 · url=https://github.com/deepseek-ai/Janus · stars=17749 · hn=36 points/3 comments · raw={\"repo\":\"deepseek-ai/Janus\",\"description\":\"Janus-Series: Unified Multimodal Understanding and Generation Models\",\"language\":\"Python\"}"},{"ref":"E33","kind":"event","title":"deepseek-ai/DeepSeek-V4-Flash-Base","date":"2026-04-22T06:03:55+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash-Base","signal_url":"https://onlylabs.fyi/signals/8c53a2f1-05e0-43c1-8fc8-d579e4e76046","signal_json_url":"https://onlylabs.fyi/signals/8c53a2f1-05e0-43c1-8fc8-d579e4e76046/signal.json","text":"model_released · deepseek-ai/DeepSeek-V4-Flash-Base · signal_desk=releases · occurred_at=2026-04-22T06:03:55+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash-Base · hf_downloads=67567 · hf_likes=242 · hf_params=292021347282"},{"ref":"E34","kind":"event","title":"deepseek-ai/DeepSeek-V2.5-1210","date":"2024-12-10T05:51:13+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V2.5-1210","signal_url":"https://onlylabs.fyi/signals/75e1ebab-44c6-49c9-8729-04be63066201","signal_json_url":"https://onlylabs.fyi/signals/75e1ebab-44c6-49c9-8729-04be63066201/signal.json","text":"model_released · deepseek-ai/DeepSeek-V2.5-1210 · signal_desk=releases · occurred_at=2024-12-10T05:51:13+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V2.5-1210 · hf_downloads=614 · hf_likes=257 · hf_params=235741434880 · pipeline=text-generation · license=other"},{"ref":"E35","kind":"event","title":"deepseek-ai/LPLB","date":"2025-11-19T07:02:55+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/LPLB","signal_url":"https://onlylabs.fyi/signals/982f6427-b213-4835-afa2-4740dbb9f7e2","signal_json_url":"https://onlylabs.fyi/signals/982f6427-b213-4835-afa2-4740dbb9f7e2/signal.json","text":"repo_new · deepseek-ai/LPLB · signal_desk=repos · occurred_at=2025-11-19T07:02:55+00:00 · url=https://github.com/deepseek-ai/LPLB · stars=504 · hn=43 points/0 comments · raw={\"repo\":\"deepseek-ai/LPLB\",\"description\":\"An early research stage expert-parallel load balancer for MoE models based on linear programming.\",\"language\":\"Python\"}"},{"ref":"E36","kind":"event","title":"deepseek-ai/DeepSeek-VL2","date":"2024-12-13T04:55:41+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-VL2","signal_url":"https://onlylabs.fyi/signals/a7d39249-40b7-415a-ab45-44a0e758aab4","signal_json_url":"https://onlylabs.fyi/signals/a7d39249-40b7-415a-ab45-44a0e758aab4/signal.json","text":"repo_new · deepseek-ai/DeepSeek-VL2 · signal_desk=repos · occurred_at=2024-12-13T04:55:41+00:00 · url=https://github.com/deepseek-ai/DeepSeek-VL2 · stars=5298 · hn=36 points/7 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-VL2\",\"description\":\"DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding\",\"language\":\"Python\"}"},{"ref":"E37","kind":"event","title":"deepseek-ai/deepseek-vl2-small","date":"2024-12-13T09:01:03+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/deepseek-vl2-small","signal_url":"https://onlylabs.fyi/signals/a8163c1c-6aea-4ae7-9515-b374e82cc5b8","signal_json_url":"https://onlylabs.fyi/signals/a8163c1c-6aea-4ae7-9515-b374e82cc5b8/signal.json","text":"model_released · deepseek-ai/deepseek-vl2-small · signal_desk=releases · occurred_at=2024-12-13T09:01:03+00:00 · url=https://huggingface.co/deepseek-ai/deepseek-vl2-small · hf_downloads=9287 · hf_likes=179 · hf_params=16148349504 · pipeline=image-text-to-text · license=other"},{"ref":"E38","kind":"event","title":"deepseek-ai/DeepSeek-Coder","date":"2023-10-20T06:38:01+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-Coder","signal_url":"https://onlylabs.fyi/signals/b9659eb2-f721-4a8e-b850-c0f8288d8918","signal_json_url":"https://onlylabs.fyi/signals/b9659eb2-f721-4a8e-b850-c0f8288d8918/signal.json","text":"repo_new · deepseek-ai/DeepSeek-Coder · signal_desk=repos · occurred_at=2023-10-20T06:38:01+00:00 · url=https://github.com/deepseek-ai/DeepSeek-Coder · stars=23660 · hn=6 points/1 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-Coder\",\"description\":\"DeepSeek Coder: Let the Code Write Itself\",\"language\":\"Python\"}"},{"ref":"E39","kind":"event","title":"deepseek-ai/DeepSeek-Prover-V2-7B","date":"2025-04-30T06:32:28+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-7B","signal_url":"https://onlylabs.fyi/signals/394c3858-25b3-4be1-bb53-157694c52d76","signal_json_url":"https://onlylabs.fyi/signals/394c3858-25b3-4be1-bb53-157694c52d76/signal.json","text":"model_released · deepseek-ai/DeepSeek-Prover-V2-7B · signal_desk=releases · occurred_at=2025-04-30T06:32:28+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-Prover-V2-7B · hf_downloads=14672 · hf_likes=146 · hf_params=6910365696"},{"ref":"E40","kind":"event","title":"deepseek-ai/DeepSeek-V2","date":"2024-04-22T06:55:47+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-V2","signal_url":"https://onlylabs.fyi/signals/a81767a4-0a0f-4023-9894-f9dcb59aad52","signal_json_url":"https://onlylabs.fyi/signals/a81767a4-0a0f-4023-9894-f9dcb59aad52/signal.json","text":"repo_new · deepseek-ai/DeepSeek-V2 · signal_desk=repos · occurred_at=2024-04-22T06:55:47+00:00 · url=https://github.com/deepseek-ai/DeepSeek-V2 · stars=5011 · hn=14 points/3 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-V2\",\"description\":\"DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model\"}"},{"ref":"E41","kind":"event","title":"deepseek-ai/DeepSeek-Coder-V2","date":"2024-06-14T03:39:37+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-Coder-V2","signal_url":"https://onlylabs.fyi/signals/909fc632-e326-4fcc-ae6c-956f11c8fa75","signal_json_url":"https://onlylabs.fyi/signals/909fc632-e326-4fcc-ae6c-956f11c8fa75/signal.json","text":"repo_new · deepseek-ai/DeepSeek-Coder-V2 · signal_desk=repos · occurred_at=2024-06-14T03:39:37+00:00 · url=https://github.com/deepseek-ai/DeepSeek-Coder-V2 · stars=6833 · hn=9 points/1 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-Coder-V2\",\"description\":\"DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence\"}"},{"ref":"E42","kind":"event","title":"deepseek-ai/DeepSeek-LLM","date":"2023-11-29T11:07:49+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-LLM","signal_url":"https://onlylabs.fyi/signals/0467336a-afd7-4b55-a68f-71e5b591bb5c","signal_json_url":"https://onlylabs.fyi/signals/0467336a-afd7-4b55-a68f-71e5b591bb5c/signal.json","text":"repo_new · deepseek-ai/DeepSeek-LLM · signal_desk=repos · occurred_at=2023-11-29T11:07:49+00:00 · url=https://github.com/deepseek-ai/DeepSeek-LLM · stars=7034 · hn=7 points/1 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-LLM\",\"description\":\"DeepSeek LLM: Let there be answers\",\"language\":\"Makefile\"}"},{"ref":"E43","kind":"event","title":"deepseek-ai/DeepSeek-V3.2-Exp-Base","date":"2025-09-29T02:43:23+00:00","date_source":"source","source_url":"https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp-Base","signal_url":"https://onlylabs.fyi/signals/fbc2ecbf-d135-4c02-b649-64fee9c020b2","signal_json_url":"https://onlylabs.fyi/signals/fbc2ecbf-d135-4c02-b649-64fee9c020b2/signal.json","text":"model_released · deepseek-ai/DeepSeek-V3.2-Exp-Base · signal_desk=releases · occurred_at=2025-09-29T02:43:23+00:00 · url=https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp-Base · hf_downloads=348 · hf_likes=67 · hf_params=685396921376 · pipeline=text-generation · license=mit"},{"ref":"E44","kind":"event","title":"deepseek-ai/FlashMLA","date":"2025-02-21T06:31:27+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/FlashMLA","signal_url":"https://onlylabs.fyi/signals/a59ea9f1-6490-4933-b44a-c1d0b7198709","signal_json_url":"https://onlylabs.fyi/signals/a59ea9f1-6490-4933-b44a-c1d0b7198709/signal.json","text":"repo_new · deepseek-ai/FlashMLA · signal_desk=repos · occurred_at=2025-02-21T06:31:27+00:00 · url=https://github.com/deepseek-ai/FlashMLA · stars=12698 · raw={\"repo\":\"deepseek-ai/FlashMLA\",\"description\":\"FlashMLA: Efficient Multi-head Latent Attention Kernels\",\"language\":\"C++\"}"},{"ref":"E45","kind":"event","title":"deepseek-ai/DeepSeek-Math-V2","date":"2025-11-27T06:12:26+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-Math-V2","signal_url":"https://onlylabs.fyi/signals/ea19f925-5930-481c-ae17-16bbbbf865bc","signal_json_url":"https://onlylabs.fyi/signals/ea19f925-5930-481c-ae17-16bbbbf865bc/signal.json","text":"repo_new · deepseek-ai/DeepSeek-Math-V2 · signal_desk=repos · occurred_at=2025-11-27T06:12:26+00:00 · url=https://github.com/deepseek-ai/DeepSeek-Math-V2 · stars=1590 · hn=9 points/0 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-Math-V2\",\"language\":\"Python\"}"},{"ref":"E46","kind":"event","title":"deepseek-ai/3FS","date":"2025-02-27T13:36:53+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/3FS","signal_url":"https://onlylabs.fyi/signals/ec0e7f81-e2e1-436d-8563-e8c2e2df5e6a","signal_json_url":"https://onlylabs.fyi/signals/ec0e7f81-e2e1-436d-8563-e8c2e2df5e6a/signal.json","text":"repo_new · deepseek-ai/3FS · signal_desk=repos · occurred_at=2025-02-27T13:36:53+00:00 · url=https://github.com/deepseek-ai/3FS · stars=9956 · data_radar_lanes=Infrastructure · data_radar_terms=inference, training · data_radar_reason=DeepSeek has a repo signal matching infrastructure. · raw={\"repo\":\"deepseek-ai/3FS\",\"description\":\" A high-performance distributed file system designed to address the challenges of AI training and inference workloads. \",\"language\":\"C++\"}"},{"ref":"E47","kind":"event","title":"deepseek-ai/DeepSeek-OCR-2","date":"2026-01-27T03:05:42+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-OCR-2","signal_url":"https://onlylabs.fyi/signals/9559d00e-0b1d-47ca-9ca0-39fd9e151508","signal_json_url":"https://onlylabs.fyi/signals/9559d00e-0b1d-47ca-9ca0-39fd9e151508/signal.json","text":"repo_new · deepseek-ai/DeepSeek-OCR-2 · signal_desk=repos · occurred_at=2026-01-27T03:05:42+00:00 · url=https://github.com/deepseek-ai/DeepSeek-OCR-2 · stars=2948 · hn=7 points/0 comments · raw={\"repo\":\"deepseek-ai/DeepSeek-OCR-2\",\"description\":\"Visual Causal Flow\",\"language\":\"Python\"}"},{"ref":"E48","kind":"event","title":"deepseek-ai/DeepEP","date":"2025-02-17T01:33:04+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepEP","signal_url":"https://onlylabs.fyi/signals/26e4b823-e672-4c0d-9f52-3fb9441bd571","signal_json_url":"https://onlylabs.fyi/signals/26e4b823-e672-4c0d-9f52-3fb9441bd571/signal.json","text":"repo_new · deepseek-ai/DeepEP · signal_desk=repos · occurred_at=2025-02-17T01:33:04+00:00 · url=https://github.com/deepseek-ai/DeepEP · stars=9714 · raw={\"repo\":\"deepseek-ai/DeepEP\",\"description\":\"DeepEP: an efficient expert-parallel communication library\",\"language\":\"Cuda\"}"},{"ref":"E49","kind":"event","title":"deepseek-ai/DeepGEMM nv_dev_950d31a","date":"2026-06-02T02:21:17+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepGEMM/releases/tag/nv_dev_950d31a","signal_url":"https://onlylabs.fyi/signals/327262a9-01cf-4a25-bf10-058239428a1c","signal_json_url":"https://onlylabs.fyi/signals/327262a9-01cf-4a25-bf10-058239428a1c/signal.json","text":"release · deepseek-ai/DeepGEMM nv_dev_950d31a · signal_desk=releases · occurred_at=2026-06-02T02:21:17+00:00 · url=https://github.com/deepseek-ai/DeepGEMM/releases/tag/nv_dev_950d31a · raw={\"repo\":\"deepseek-ai/DeepGEMM\"}"},{"ref":"E50","kind":"event","title":"deepseek-ai/open-infra-index","date":"2025-02-21T02:29:19+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/open-infra-index","signal_url":"https://onlylabs.fyi/signals/e6d1af4c-1a74-4b82-90e2-1c23bdb1cf6d","signal_json_url":"https://onlylabs.fyi/signals/e6d1af4c-1a74-4b82-90e2-1c23bdb1cf6d/signal.json","text":"repo_new · deepseek-ai/open-infra-index · signal_desk=repos · occurred_at=2025-02-21T02:29:19+00:00 · url=https://github.com/deepseek-ai/open-infra-index · stars=8003 · data_radar_lanes=Infrastructure, Product and customer · data_radar_terms=infra, infrastructure, product · data_radar_reason=DeepSeek has a repo signal matching infrastructure, product and customer. · raw={\"repo\":\"deepseek-ai/open-infra-index\",\"description\":\"Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation\"}"},{"ref":"E51","kind":"event","title":"deepseek-ai/awesome-deepseek-agent","date":"2026-04-27T06:37:38+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/awesome-deepseek-agent","signal_url":"https://onlylabs.fyi/signals/31ab18b8-e694-4bc1-93b8-2a42088e7a91","signal_json_url":"https://onlylabs.fyi/signals/31ab18b8-e694-4bc1-93b8-2a42088e7a91/signal.json","text":"repo_new · deepseek-ai/awesome-deepseek-agent · signal_desk=repos · occurred_at=2026-04-27T06:37:38+00:00 · url=https://github.com/deepseek-ai/awesome-deepseek-agent · stars=3483 · raw={\"repo\":\"deepseek-ai/awesome-deepseek-agent\"}"},{"ref":"E52","kind":"event","title":"deepseek-ai/TileKernels","date":"2026-04-22T05:38:31+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/TileKernels","signal_url":"https://onlylabs.fyi/signals/0d5a2ac4-b1e6-45e0-a491-410adc1c5100","signal_json_url":"https://onlylabs.fyi/signals/0d5a2ac4-b1e6-45e0-a491-410adc1c5100/signal.json","text":"repo_new · deepseek-ai/TileKernels · signal_desk=repos · occurred_at=2026-04-22T05:38:31+00:00 · url=https://github.com/deepseek-ai/TileKernels · stars=1581 · hn=2 points/0 comments · raw={\"repo\":\"deepseek-ai/TileKernels\",\"description\":\"A kernel library written in tilelang\",\"language\":\"Python\"}"},{"ref":"E53","kind":"event","title":"deepseek-ai/Engram","date":"2026-01-12T05:26:50+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/Engram","signal_url":"https://onlylabs.fyi/signals/142c0e97-8b96-4959-843a-8f24b923afba","signal_json_url":"https://onlylabs.fyi/signals/142c0e97-8b96-4959-843a-8f24b923afba/signal.json","text":"repo_new · deepseek-ai/Engram · signal_desk=repos · occurred_at=2026-01-12T05:26:50+00:00 · url=https://github.com/deepseek-ai/Engram · stars=4447 · hn=3 points/0 comments · raw={\"repo\":\"deepseek-ai/Engram\",\"description\":\"Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models\",\"language\":\"Python\"}"},{"ref":"E54","kind":"event","title":"deepseek-ai/DeepGEMM","date":"2025-02-13T09:09:21+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepGEMM","signal_url":"https://onlylabs.fyi/signals/0ba50d62-1566-4352-b0e3-a4dca7d1708f","signal_json_url":"https://onlylabs.fyi/signals/0ba50d62-1566-4352-b0e3-a4dca7d1708f/signal.json","text":"repo_new · deepseek-ai/DeepGEMM · signal_desk=repos · occurred_at=2025-02-13T09:09:21+00:00 · url=https://github.com/deepseek-ai/DeepGEMM · stars=7364 · data_radar_lanes=Infrastructure · data_radar_terms=scaling · data_radar_reason=DeepSeek has a repo signal matching infrastructure. · raw={\"repo\":\"deepseek-ai/DeepGEMM\",\"description\":\"DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling\",\"language\":\"Cuda\"}"},{"ref":"E55","kind":"event","title":"deepseek-ai/DeepGEMM nv_dev_67fc648","date":"2026-05-13T02:47:37+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepGEMM/releases/tag/nv_dev_67fc648","signal_url":"https://onlylabs.fyi/signals/7f61f5c7-d4e8-4bc5-99f6-c01a6c7c59e5","signal_json_url":"https://onlylabs.fyi/signals/7f61f5c7-d4e8-4bc5-99f6-c01a6c7c59e5/signal.json","text":"release · deepseek-ai/DeepGEMM nv_dev_67fc648 · signal_desk=releases · occurred_at=2026-05-13T02:47:37+00:00 · url=https://github.com/deepseek-ai/DeepGEMM/releases/tag/nv_dev_67fc648 · raw={\"repo\":\"deepseek-ai/DeepGEMM\"}"},{"ref":"E56","kind":"event","title":"deepseek-ai/smallpond","date":"2025-02-24T09:28:17+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/smallpond","signal_url":"https://onlylabs.fyi/signals/8a32dd4e-768c-4ff9-880a-b4e704ebbf57","signal_json_url":"https://onlylabs.fyi/signals/8a32dd4e-768c-4ff9-880a-b4e704ebbf57/signal.json","text":"repo_new · deepseek-ai/smallpond · signal_desk=repos · occurred_at=2025-02-24T09:28:17+00:00 · url=https://github.com/deepseek-ai/smallpond · stars=4960 · data_radar_lanes=Data demand · data_radar_terms=data · data_radar_reason=DeepSeek has a repo signal matching data demand. · raw={\"repo\":\"deepseek-ai/smallpond\",\"description\":\"A lightweight data processing framework built on DuckDB and 3FS.\",\"language\":\"Python\"}"},{"ref":"E57","kind":"event","title":"deepseek-ai/DeepSeek-VL","date":"2024-03-07T08:32:57+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepSeek-VL","signal_url":"https://onlylabs.fyi/signals/3193593d-28c4-4e37-8b76-24e4296846c3","signal_json_url":"https://onlylabs.fyi/signals/3193593d-28c4-4e37-8b76-24e4296846c3/signal.json","text":"repo_new · deepseek-ai/DeepSeek-VL · signal_desk=repos · occurred_at=2024-03-07T08:32:57+00:00 · url=https://github.com/deepseek-ai/DeepSeek-VL · stars=4126 · raw={\"repo\":\"deepseek-ai/DeepSeek-VL\",\"description\":\"DeepSeek-VL: Towards Real-World Vision-Language Understanding\",\"language\":\"Python\"}"},{"ref":"E58","kind":"event","title":"deepseek-ai/DeepGEMM nv_dev_c491439","date":"2026-04-22T08:26:51+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DeepGEMM/releases/tag/nv_dev_c491439","signal_url":"https://onlylabs.fyi/signals/e59b5c06-c6dd-4f69-b437-266d41560fe2","signal_json_url":"https://onlylabs.fyi/signals/e59b5c06-c6dd-4f69-b437-266d41560fe2/signal.json","text":"release · deepseek-ai/DeepGEMM nv_dev_c491439 · signal_desk=releases · occurred_at=2026-04-22T08:26:51+00:00 · url=https://github.com/deepseek-ai/DeepGEMM/releases/tag/nv_dev_c491439 · raw={\"repo\":\"deepseek-ai/DeepGEMM\"}"},{"ref":"E59","kind":"event","title":"deepseek-ai/DreamCraft3D","date":"2023-10-23T07:40:20+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DreamCraft3D","signal_url":"https://onlylabs.fyi/signals/188bf97f-9581-4d03-9927-f5148bd76f56","signal_json_url":"https://onlylabs.fyi/signals/188bf97f-9581-4d03-9927-f5148bd76f56/signal.json","text":"repo_new · deepseek-ai/DreamCraft3D · signal_desk=repos · occurred_at=2023-10-23T07:40:20+00:00 · url=https://github.com/deepseek-ai/DreamCraft3D · stars=3006 · raw={\"repo\":\"deepseek-ai/DreamCraft3D\",\"description\":\"[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior\",\"language\":\"Python\"}"},{"ref":"E60","kind":"event","title":"deepseek-ai/DualPipe","date":"2025-02-26T13:29:57+00:00","date_source":"source","source_url":"https://github.com/deepseek-ai/DualPipe","signal_url":"https://onlylabs.fyi/signals/9e571e05-35bc-41ff-a9da-c5f796566988","signal_json_url":"https://onlylabs.fyi/signals/9e571e05-35bc-41ff-a9da-c5f796566988/signal.json","text":"repo_new · deepseek-ai/DualPipe · signal_desk=repos · occurred_at=2025-02-26T13:29:57+00:00 · url=https://github.com/deepseek-ai/DualPipe · stars=2965 · data_radar_lanes=Infrastructure · data_radar_terms=training · data_radar_reason=DeepSeek has a repo signal matching infrastructure. · raw={\"repo\":\"deepseek-ai/DualPipe\",\"description\":\"A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.\",\"language\":\"Python\"}"}]}