{"schema_version":"onlylabs.public_analysis_evidence.v1","title":"Zhipu AI (GLM) analysis evidence pack","description":"Public onlylabs evidence pack for cited agent analysis: captured pages, ranked public signals, and stored web-search provenance used by the background analysis workflow.","url":"https://onlylabs.fyi/analysis/zhipu","json_url":"https://onlylabs.fyi/analysis/zhipu/evidence.json","generated_at":"2026-06-11T18:08:19.694Z","org":{"slug":"zhipu","name":"Zhipu AI (GLM)","category":"frontier-lab","category_label":"Frontier lab","dossier_url":"https://onlylabs.fyi/labs/zhipu"},"analysis":{"url":"https://onlylabs.fyi/analysis/zhipu","json_url":"https://onlylabs.fyi/analysis/zhipu/analysis.json","generated_at":"2026-06-08T15:59:10.224+00:00"},"workflow":{"version":"synthesize-analyses","provider":null,"model":null,"agent":null,"public_pack_mode":"local-pages-and-events","live_web_fetches":false,"note":"Public evidence exports do not trigger live Exa calls; stored Exa provenance is included when analysis metadata contains it."},"stats":{"pages":28,"events":96,"web":0,"evidence":88,"signal_desks":{"hiring":0,"forks":0,"releases":30,"talking":0,"repos":30},"data_radar_lanes":{"data":0,"evals":1,"infrastructure":0,"safety":0,"product":0},"data_radar_matches":1,"stored_analysis_evidence":null,"stored_analysis_web":null,"stored_analysis_signal_desks":null,"stored_analysis_data_radar_lanes":null,"stored_analysis_data_radar_matches":null},"stored_web_provenance":null,"evidence":[{"ref":"P1","kind":"page","title":"zai-org/CogView2 repository metadata","date":"2026-06-11T01:46:01.81971+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogView2","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogView2\n\nDescription: official code repo for paper \"CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers\"\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 955\n\nForks: 86\n\nOpen issues: 25\n\nCreated: 2022-04-25T10:15:47Z\n\nPushed: 2022-08-03T17:38:41Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<p align=\"center\">\n<img src=\"assets/logo2.png\"/>\n</p>\n<p align=\"center\">\n<b>Generate vivid Images for Chinese / English text</b>\n</p>\n\nCogView2 is a hierarchical transformer (6B-9B-9B parameters) for text-to-image generation in general domain. This implementation is based on the [SwissArmyTransformer](https://github.com/THUDM/SwissArmyTransformer) library (v0.2).\n\n* **Read** our paper [CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers](https://arxiv.org/abs/2204.14217) on ArXiv for a formal introduction. The *LoPAR* accelarate the generation and *CogLM* enables the model for bidirectional completion.\n* **Run** our pretrained models from text-to-image generation or text-guided completion! Please use A100 GPU.\n* **Cite** our paper if you find our work is helpful~ \n```\n@article{ding2022cogview2,\ntitle={CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers},\nauthor={Ding, Ming and Zheng, Wendi and Hong, Wenyi and Tang, Jie},\njournal={arXiv preprint arXiv:2204.14217},\nyear={2022}\n}\n```\n\n## Web Demo\n\n- Thank the Huggingface team for integrating CogView2 into [Huggingface Spaces 🤗](https://huggingface.co/spaces) using [Gradio](https://github.com/gradio-app/gradio). Try out the Web Demo: [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/THUDM/CogView2)\n\n- Thank the Replicate team to deploy a web demo! Try at [![Replicate](https://replicate.com/thudm/cogview2/badge)](https://replicate.com/thudm/cogview2) .\n\n## Getting Started\n### Setup\n* Hardware: Linux servers with Nvidia A100s are recommended, but it is also okay to run the pretrained models with smaller `--max-inference-batch-size` or training smaller models on less powerful GPUs.\n* Environment: install dependencies via `pip install -r requireme"},{"ref":"P2","kind":"page","title":"zai-org/CogView repository metadata","date":"2026-06-11T01:46:01.576822+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogView","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogView\n\nDescription: Text-to-Image generation. The repo for NeurIPS 2021 paper \"CogView: Mastering Text-to-Image Generation via Transformers\".\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 1797\n\nForks: 176\n\nOpen issues: 19\n\nCreated: 2021-05-25T14:48:31Z\n\nPushed: 2023-09-25T04:07:19Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<p align=\"center\">\n<img src=\"assets/logo.png\"/>\n</p>\n<p align=\"center\">\n<b>Generate vivid Images for <i>Any</i> (Chinese) text</b>\n</p>\n\n![teaser](assets/cogviewcase.png)\n\n**News!** The paper of ImageReward is accepted by NeurIPS 2023!\n\n**News!** The codes of ImageReward ([paper link](https://arxiv.org/abs/2304.05977.pdf)) have been released at https://github.com/THUDM/ImageReward! ImageReward is the first general-purpose text-to-image human preference RM.\n\n**News!** The codes of CogView2 ([paper link](https://arxiv.org/pdf/2105.13290.pdf)) have been released at https://github.com/THUDM/CogView2! \n\n**News!** The [demo](https://agc.platform.baai.ac.cn/CogView/index.html) for a better and faster CogView2 (formal version, March 2022) is available! The lastest model also supports English input, but to translate them into Chinese often could be better.\n\n**News!** The [demo](https://agc.platform.baai.ac.cn/CogView/index.html) for a better and faster CogView2 (new version) is available!\n\n**News!** The paper of CogView is accepted by NeurIPS 2021! \n\nCogView is a pretrained (4B-param) transformer for text-to-image generation in general domain.\n\n* **Read** our paper [CogView: Mastering Text-to-Image Generation via Transformers](https://arxiv.org/pdf/2105.13290.pdf) on ArXiv for a formal introduction. The *PB-relax* and *Sandwich-LN* can also help you train large and deep transformers stably (e.g. eliminating NaN losses).\n* **Visit** our demo at [Github Page](https://thudm.github.io/CogView/index.html) or [Wudao](https://wudao.aminer.cn/CogView/)! (Without post-selection or super-resolution, currently only supports simplified Chinese input, but one can translate text from other languages into Chinese for input. Note: *Wudao* provides faster access for users from China mainland.)\n* **Download** our pretrained models from [Tsinghu"},{"ref":"P3","kind":"page","title":"zai-org/GLM-130B repository metadata","date":"2026-06-11T01:46:01.470489+00:00","date_source":null,"source_url":"https://github.com/zai-org/GLM-130B","signal_url":null,"signal_json_url":null,"text":"# zai-org/GLM-130B\n\nDescription: GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 7651\n\nForks: 603\n\nOpen issues: 124\n\nCreated: 2022-08-03T20:21:58Z\n\nPushed: 2023-07-25T09:01:49Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<img src=\"resources/7D6433A42D189E2E6FBC62BE066BCE91.png\">\n\n<p align=\"center\">\n🌐 <a href=\"http://keg.cs.tsinghua.edu.cn/glm-130b/posts/glm-130b/\" target=\"_blank\">Blog</a> • ⏬ <a href=\"https://docs.google.com/forms/d/e/1FAIpQLSehr5Dh_i3TwACmFFi8QEgIVNYGmSPwV0GueIcsUev0NEfUug/viewform\" target=\"_blank\">Download Model</a> • 🪧 <a href=\"https://huggingface.co/spaces/THUDM/GLM-130B\" target=\"_blank\">Demo</a> • ✉️ <a href=\"mailto:glm-130b@googlegroups.com\">Email</a> • 📃 <a href=\"https://arxiv.org/abs/2210.02414\" target=\"_blank\">Paper [ICLR 2023]</a><br>\n</p>\n\n<p align=\"center\">\n💬 <a href=\"https://groups.google.com/g/glm-130b-forum\" target=\"_blank\">Google Group</a> (Updates) or <a href=\"https://github.com/THUDM/GLM-130B/blob/main/resources/WECHAT.md\" target=\"_blank\">Wechat Group</a> or <a href=\"https://join.slack.com/t/glm-130b/shared_invite/zt-1f2ih11xy-EAuDComTAr~XVB3MywE9Cg\" target=\"_blank\">Slack channel</a> (Discussions)\n</p>\n\n# GLM-130B: An Open Bilingual Pre-Trained Model\n\nGLM-130B is an open bilingual (English & Chinese) bidirectional dense model with 130 billion parameters, pre-trained using the algorithm of [General Language Model (GLM)](https://aclanthology.org/2022.acl-long.26). It is designed to support inference tasks with the 130B parameters on **a single A100 (40G * 8)** or **V100 (32G * 8) server**. With INT4 quantization, the hardware requirements can further be reduced to **a single server with 4 * RTX 3090 (24G)** with **almost no performance degradation**. As of July 3rd, 2022, GLM-130B has been trained on over 400 billion text tokens (200B each for Chinese and English) and it has the following unique features:\n\n- **Bilingual:** supports both English and Chinese. \n- **Performance (EN):** better than GPT-3 175B (+4.0%), OPT-175B (+5.5%), and BLOOM-176B (+13.0%) on LAMBADA and slightly better than GPT-3 175B (+0.9%) on MMLU.\n- **Performance (CN):** significantly bett"},{"ref":"P4","kind":"page","title":"zai-org/CogVideo repository metadata","date":"2026-06-11T01:46:01.468029+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogVideo","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogVideo\n\nDescription: text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 12774\n\nForks: 1305\n\nOpen issues: 113\n\nCreated: 2022-05-29T06:46:18Z\n\nPushed: 2025-11-04T11:19:04Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# CogVideo & CogVideoX\n\n[中文阅读](./README_zh.md)\n\n[日本語で読む](./README_ja.md)\n\n<div align=\"center\">\n<img src=resources/logo.svg width=\"50%\"/>\n</div>\n<p align=\"center\">\nExperience the CogVideoX-5B model online at <a href=\"https://huggingface.co/spaces/THUDM/CogVideoX-5B\" target=\"_blank\"> 🤗 Huggingface Space</a> or <a href=\"https://modelscope.cn/studios/ZhipuAI/CogVideoX-5b-demo\" target=\"_blank\"> 🤖 ModelScope Space</a>\n</p>\n<p align=\"center\">\n📚 View the <a href=\"https://arxiv.org/abs/2408.06072\" target=\"_blank\">paper</a> and <a href=\"https://zhipu-ai.feishu.cn/wiki/DHCjw1TrJiTyeukfc9RceoSRnCh\" target=\"_blank\">user guide</a>\n</p>\n<p align=\"center\">\n👋 Join our <a href=\"resources/WECHAT.md\" target=\"_blank\">WeChat</a> and <a href=\"https://discord.gg/dCGfUsagrD\" target=\"_blank\">Discord</a>\n</p>\n<p align=\"center\">\n📍 Visit <a href=\"https://chatglm.cn/video?lang=en?fr=osm_cogvideo\">QingYing</a> and <a href=\"https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9\">API Platform</a> to experience larger-scale commercial video generation models.\n</p>\n\n## Project Updates\n\n- 🔥🔥 **News**: ```2025/03/24```: We have launched [CogKit](https://github.com/THUDM/CogKit), a fine-tuning and inference framework for the **CogView4** and **CogVideoX** series. This toolkit allows you to fully explore and utilize our multimodal generation models.\n- 🔥 **News**: ```2025/02/28```: DDIM Inverse is now supported in `CogVideoX-5B` and `CogVideoX1.5-5B`. Check [here](inference/ddim_inversion.py).\n- 🔥 **News**: ```2025/01/08```: We have updated the code for `Lora` fine-tuning based on the `diffusers` version model, which uses less GPU memory. For more details, please see [here](finetune/README.md).\n- 🔥 **News**: ```2024/11/15```: We released the `CogVideoX1.5` model in the diffusers version. Only minor parameter adjustments are needed to continue using previous code.\n- 🔥 *"},{"ref":"P5","kind":"page","title":"zai-org/CodeGeeX repository metadata","date":"2026-06-11T01:46:00.873812+00:00","date_source":null,"source_url":"https://github.com/zai-org/CodeGeeX","signal_url":null,"signal_json_url":null,"text":"# zai-org/CodeGeeX\n\nDescription: CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 8790\n\nForks: 690\n\nOpen issues: 186\n\nCreated: 2022-09-17T14:06:29Z\n\nPushed: 2024-08-13T05:59:38Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<img src=\"resources/logo/codegeex_logo.png\">\n\n<p align=\"center\">\n🏠 <a href=\"https://codegeex.cn\" target=\"_blank\">Homepage</a> | 📖 <a href=\"https://models.aminer.cn/codegeex/blog/\" target=\"_blank\">Blog</a> | 🪧 <a href=\"https://models.aminer.cn/codegeex/playground\" target=\"_blank\">DEMO</a> | 🤖 <a href=\"https://codegeex.cn/download/request\" target=\"_blank\">Download Model</a> | 📄 <a href=\"https://arxiv.org/abs/2303.17568\" target=\"_blank\">Paper</a> | 🌐 <a href=\"README_zh.md\" target=\"_blank\">中文</a>\n</p>\n<p align=\"center\">\n🛠 <a href=\"https://marketplace.visualstudio.com/items?itemName=aminer.codegeex\" target=\"_blank\">VS Code</a>, <a href=\"https://plugins.jetbrains.com/plugin/20587-codegeex\" target=\"_blank\">Jetbrains</a>, <a href=\"https://plugins.jetbrains.com/plugin/20587-codegeex\" target=\"_blank\">Cloud Studio</a> supported | 👋 Join our <a href=\"https://discord.gg/8gjHdkmAN6\" target=\"_blank\">Discord</a>, <a href=\"https://join.slack.com/t/codegeexworkspace/shared_invite/zt-1s118ffrp-mpKKhQD0tKBmzNZVCyEZLw\" target=\"_blank\">Slack</a>, <a href=\"https://t.me/+IipIayJ32B1jOTg1\" target=\"_blank\">Telegram</a>, <a href=\"resources/zh/wechat.md\"target=\"_blank\">WeChat</a>\n</p>\n\n🌟 The newest [CodeGeeX4](https://github.com/THUDM/CodeGeeX4) has been released. | 最新一代 [CodeGeeX4](https://github.com/THUDM/CodeGeeX4) 模型已经正式开源。\n\n- [CodeGeeX: A Multilingual Code Generation Model](#codegeex-a-multilingual-code-generation-model)\n- [News](#news)\n- [Getting Started](#getting-started)\n- [Installation](#installation)\n- [Model Weights](#model-weights)\n- [Inference on GPUs](#inference-on-gpus)\n- [VS Code and Jetbrains Extension Guidance](#vs-code-and-jetbrains-extension-guidance)\n- [CodeGeeX: Architecture, Code Corpus, and Implementation](#codegeex-architecture-code-corpus-and-implementation)\n- [HumanEval-X: A new benchmark for Multilingual Program Synthesis](#humaneval-x-a-new-benchmark-for-multilin"},{"ref":"P6","kind":"page","title":"zai-org/ImageReward repository metadata","date":"2026-06-11T01:46:00.793098+00:00","date_source":null,"source_url":"https://github.com/zai-org/ImageReward","signal_url":null,"signal_json_url":null,"text":"# zai-org/ImageReward\n\nDescription: [NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 1681\n\nForks: 92\n\nOpen issues: 60\n\nCreated: 2023-04-01T09:04:17Z\n\nPushed: 2025-10-29T13:40:51Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# ImageReward\n\n<p align=\"center\">\n📃 <a href=\"https://arxiv.org/abs/2304.05977\" target=\"_blank\">Paper</a> • 🖼 <a href=\"https://huggingface.co/datasets/THUDM/ImageRewardDB\" target=\"_blank\">Dataset</a> • 🌐 <a href=\"https://zhuanlan.zhihu.com/p/639494251\" target=\"_blank\">中文博客</a> • 🤗 <a href=\"https://huggingface.co/THUDM/ImageReward\" target=\"_blank\">HF Repo</a> • 🐦 <a href=\"https://twitter.com/thukeg\" target=\"_blank\">Twitter</a> <br>\n</p>\n\n🔥🔥 **News!** ```2024/12/31```: We released the **next generation of model, [VisionReward](https://github.com/THUDM/VisionReward)**, which is a fine-grained and multi-dimensional reward model for stable RLHF for visual generation (text-to-image / text-to-video)!\n\n🔥 **News!** ```2023/9/22```: The paper of ImageReward is accepted by NeurIPS 2023!\n\n**ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation**\n\nImageReward is the first general-purpose text-to-image human preference RM, which is trained on in total **137k pairs of expert comparisons**, outperforming existing text-image scoring methods, such as CLIP (by 38.6%), Aesthetic (by 39.6%), and BLIP (by 31.6%), in terms of understanding human preference in text-to-image synthesis.\n\nAdditionally, we introduce Reward Feedback Learning (ReFL) for direct optimizing a text-to-image diffusion model using ImageReward. ReFL-tuned Stable Diffusion wins against untuned version by 58.4% in human evaluation.\n\nBoth ImageReward and ReFL are all packed up to Python `image-reward` package now!\n\n[![PyPI](https://img.shields.io/pypi/v/image-reward)](https://pypi.org/project/image-reward/) [![Downloads](https://static.pepy.tech/badge/image-reward)](https://pepy.tech/project/image-reward)\n\nTry `image-reward` package in only 3 lines of code for ImageReward scoring!\n\n```python\n# pip install image-reward\nimport ImageReward as RM\nmodel = RM.loa"},{"ref":"P7","kind":"page","title":"zai-org/ChatGLM-6B repository metadata","date":"2026-06-11T01:46:00.770603+00:00","date_source":null,"source_url":"https://github.com/zai-org/ChatGLM-6B","signal_url":null,"signal_json_url":null,"text":"# zai-org/ChatGLM-6B\n\nDescription: ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 41048\n\nForks: 5144\n\nOpen issues: 608\n\nCreated: 2023-03-13T12:00:18Z\n\nPushed: 2024-06-27T04:05:25Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# ChatGLM-6B\n\n<p align=\"center\">\n🌐 <a href=\"https://chatglm.cn/blog\" target=\"_blank\">Blog</a> • 🤗 <a href=\"https://huggingface.co/THUDM/chatglm-6b\" target=\"_blank\">HF Repo</a> • 🐦 <a href=\"https://twitter.com/thukeg\" target=\"_blank\">Twitter</a> • 📄<a href=\"https://arxiv.org/pdf/2406.12793\" target=\"_blank\"> Report </a> <br>\n</p>\n<p align=\"center\">\n👋 加入我们的 <a href=\"https://discord.gg/fK2dz4bg\" target=\"_blank\">Discord</a> 和 <a href=\"resources/WECHAT.md\" target=\"_blank\">WeChat</a>\n</p>\n<p align=\"center\">\n📍在 <a href=\"https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9\">智谱AI开放平台</a> 体验和使用更大规模的 GLM 商业模型。\n</p>\n\n*Read this in [English](README_en.md).*\n\n## GLM-4 开源模型和API\n\n我们已经发布最新的 **GLM-4** 大语言对话模型，该模型在多个指标上有了新的突破，您可以在以下两个渠道体验我们的最新模型。\n\n+ [GLM-4 开源模型](https://github.com/THUDM/GLM-4) 我们已经开源了 GLM-4-9B 系列模型，在各项指标的ce是上有明显提升，欢迎尝试。\n+ [智谱清言](https://chatglm.cn/main/detail?fr=ecology_x) 体验最新版 GLM-4，包括 **GLMs，All tools**等功能。\n+ [API平台](https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9) 新一代 API 平台已经上线，您可以直接在\nAPI\n平台上体验 `GLM-4-0520`、`GLM-4-air`、`GLM-4-airx`、`GLM-4-flash`、`GLM-4`、`GLM-3-Turbo`、`CharacterGLM-3`，`CogView-3`\n等新模型。\n其中`GLM-4`、`GLM-3-Turbo`两个模型支持了 `System Prompt`、`Function Call`、 `Retrieval`、`Web_Search`等新功能，欢迎体验。\n\n+ [GLM-4 API 开源教程](https://github.com/MetaGLM/glm-cookbook/) GLM-4 API教程和基础应用，欢迎尝试。\nAPI相关问题可以在本开源教程疑问，或者使用 [GLM-4 API AI助手](https://open.bigmodel.cn/shareapp/v1/?share_code=sQwt5qyqYVaNh1O_87p8O)\n来获得常见问题的帮助。\n\n-----\n## 介绍\n\nChatGLM-6B 是一个开源的、支持中英双语的对话语言模型，基于 [General Language Model (GLM)](https://github.com/THUDM/GLM) 架构，具有 62 亿参数。结合模型量化技术，用户可以在消费级的显卡上进行本地部署（INT4 量化级别下最低只需 6GB 显存）。\nChatGLM-6B 使用了和 ChatGPT 相似的技术，针对中文问答和对话进行了优化。经过约 1T 标识符的中英双语训练，辅以监督微调、反馈自助、人类反馈强化学习等技术的加持，62 亿参数的 ChatGLM-6B 已经能生成相当符合人类偏好的回答，更多信息请参考我们的[博客](https://chatglm.cn/blog)。欢迎通过 [chatglm.cn](https://chatglm.cn) 体验更大规模的 ChatGLM 模型。\n\n为了方便下游开发者针对自己的应用场景定制"},{"ref":"P8","kind":"page","title":"zai-org/VisualGLM-6B repository metadata","date":"2026-06-11T01:46:00.603493+00:00","date_source":null,"source_url":"https://github.com/zai-org/VisualGLM-6B","signal_url":null,"signal_json_url":null,"text":"# zai-org/VisualGLM-6B\n\nDescription: Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 4159\n\nForks: 425\n\nOpen issues: 275\n\nCreated: 2023-04-23T09:36:56Z\n\nPushed: 2024-08-23T07:45:06Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# VisualGLM-6B\n\n<p align=\"center\">\n🤗 <a href=\"https://huggingface.co/THUDM/visualglm-6b\" target=\"_blank\">HF Repo</a> • ⚒️ <a href=\"https://github.com/THUDM/SwissArmyTransformer\" target=\"_blank\">SwissArmyTransformer (sat)</a> • 🐦 <a href=\"https://twitter.com/thukeg\" target=\"_blank\">Twitter</a> \n</p>\n<p align=\"center\">\n• 📃 <a href=\"https://arxiv.org/abs/2105.13290\" target=\"_blank\">[CogView@NeurIPS 21]</a> <a href=\"https://github.com/THUDM/CogView\" target=\"_blank\">[GitHub]</a> • 📃 <a href=\"https://arxiv.org/abs/2103.10360\" target=\"_blank\">[GLM@ACL 22]</a> <a href=\"https://github.com/THUDM/GLM\" target=\"_blank\">[GitHub]</a> <br>\n</p>\n<p align=\"center\">\n👋 加入我们的 <a href=\"https://join.slack.com/t/chatglm/shared_invite/zt-1y7pqoloy-9b1g6T6JjA8J0KxvUjbwJw\" target=\"_blank\">Slack</a> 和 <a href=\"examples/WECHAT.md\" target=\"_blank\">WeChat</a>\n</p>\n<!-- <p align=\"center\">\n🤖<a href=\"https://huggingface.co/spaces/THUDM/visualglm-6b\" target=\"_blank\">VisualGLM-6B在线演示网站</a>\n</p> -->\n\n## News\n[2023.10] 欢迎关注智谱AI新一代多模态对话模型CogVLM（ https://github.com/THUDM/CogVLM ），采用视觉专家新架构，在10项权威经典多模态任务上取得第一名。目前开源CogVLM-17B英文模型，即将基于GLM开源中文模型。\n\n## 介绍\n\nVisualGLM-6B is an open-source, multi-modal dialog language model that supports **images, Chinese, and English**. The language model is based on [ChatGLM-6B](https://github.com/THUDM/ChatGLM-6B) with 6.2 billion parameters; the image part builds a bridge between the visual model and the language model through the training of [BLIP2-Qformer](https://arxiv.org/abs/2301.12597), with the total model comprising 7.8 billion parameters. **[Click here for English version.](README_en.md)**\n\nVisualGLM-6B 是一个开源的，支持**图像、中文和英文**的多模态对话语言模型，语言模型基于 [ChatGLM-6B](https://github.com/THUDM/ChatGLM-6B)，具有 62 亿参数；图像部分通过训练 [BLIP2-Qformer](https://arxiv.org/abs/2301.12597) 构建起视觉模型与语言模型的桥梁，整体模型共78亿参数。\n\nVisualGLM-6B 依靠来自于 [CogView](https://arxiv.org/abs/2105.13290) 数据集"},{"ref":"P9","kind":"page","title":"zai-org/ChatGLM2-6B repository metadata","date":"2026-06-11T01:46:00.545298+00:00","date_source":null,"source_url":"https://github.com/zai-org/ChatGLM2-6B","signal_url":null,"signal_json_url":null,"text":"# zai-org/ChatGLM2-6B\n\nDescription: ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型\n\nLanguage: Python\n\nLicense: NOASSERTION\n\nStars: 15568\n\nForks: 1804\n\nOpen issues: 452\n\nCreated: 2023-06-24T06:21:34Z\n\nPushed: 2024-06-27T04:05:08Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# ChatGLM2-6B\n\n<p align=\"center\">\n🤗 <a href=\"https://huggingface.co/THUDM/chatglm2-6b\" target=\"_blank\">HF Repo</a> • 🐦 <a href=\"https://twitter.com/thukeg\" target=\"_blank\">Twitter</a> • 📃 <a href=\"https://arxiv.org/abs/2103.10360\" target=\"_blank\">[GLM@ACL 22]</a> <a href=\"https://github.com/THUDM/GLM\" target=\"_blank\">[GitHub]</a> • 📃 <a href=\"https://arxiv.org/abs/2210.02414\" target=\"_blank\">[GLM-130B@ICLR 23]</a> <a href=\"https://github.com/THUDM/GLM-130B\" target=\"_blank\">[GitHub]</a> <br>\n</p>\n<p align=\"center\">\n👋 加入我们的 <a href=\"https://discord.gg/fK2dz4bg\" target=\"_blank\">Discord</a> 和 <a href=\"resources/WECHAT.md\" target=\"_blank\">WeChat</a>\n</p>\n<p align=\"center\">\n📍在 <a href=\"https://www.chatglm.cn\">chatglm.cn</a> 体验更大规模的 ChatGLM 模型。\n</p>\n\n*Read this in [English](README_EN.md)*\n\n## GLM-4 开源模型和API\n\n我们已经发布最新的 **GLM-4** 模型，该模型在多个指标上有了新的突破，您可以在以下两个渠道体验我们的最新模型。\n\n+ [GLM-4 开源模型](https://github.com/THUDM/GLM-4) 我们已经开源了 GLM-4-9B 系列模型，在各项指标的ce是上有明显提升，欢迎尝试。\n+ [智谱清言](https://chatglm.cn/main/detail?fr=ecology_x) 体验最新版 GLM-4，包括 **GLMs，All tools**等功能。\n+ [API平台](https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9) 新一代 API 平台已经上线，您可以直接在\nAPI\n平台上体验 `GLM-4-0520`、`GLM-4-air`、`GLM-4-airx`、`GLM-4-flash`、`GLM-4`、`GLM-3-Turbo`、`CharacterGLM-3`，`CogView-3`\n等新模型。\n其中`GLM-4`、`GLM-3-Turbo`两个模型支持了 `System Prompt`、`Function Call`、 `Retrieval`、`Web_Search`等新功能，欢迎体验。\n\n+ [GLM-4 API 开源教程](https://github.com/MetaGLM/glm-cookbook/) GLM-4 API教程和基础应用，欢迎尝试。\nAPI相关问题可以在本开源教程疑问，或者使用 [GLM-4 API AI助手](https://open.bigmodel.cn/shareapp/v1/?share_code=sQwt5qyqYVaNh1O_87p8O)\n来获得常见问题的帮助。\n\n-----\n\n## 介绍\n\nChatGLM**2**-6B 是开源中英双语对话模型 [ChatGLM-6B](https://github.com/THUDM/ChatGLM-6B) 的第二代版本，在保留了初代模型对话流畅、部署门槛较低等众多优秀特性的基础之上，ChatGLM**2**-6B 引入了如下新特性：\n\n1. **更强大的性能**：基于 ChatGLM 初代模型的开发经验，我们全面升级了 ChatGLM2-6B 的基座模型。ChatGLM2-6B 使用了 [GLM](https://github.com/THUDM/GLM) 的混合目标函数，经过了 1.4T 中英标识符的预训练与人类偏好对齐训练，[评测结果]("},{"ref":"P10","kind":"page","title":"zai-org/CodeGeeX2 repository metadata","date":"2026-06-11T01:46:00.079278+00:00","date_source":null,"source_url":"https://github.com/zai-org/CodeGeeX2","signal_url":null,"signal_json_url":null,"text":"# zai-org/CodeGeeX2\n\nDescription: CodeGeeX2: A More Powerful Multilingual Code Generation Model\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 7553\n\nForks: 535\n\nOpen issues: 211\n\nCreated: 2023-07-23T18:26:53Z\n\nPushed: 2024-07-10T12:34:55Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n![](resources/codegeex_logo.png)\n\n<p align=\"center\">\n🏠 <a href=\"https://codegeex.cn\" target=\"_blank\">主页</a>｜🛠 插件 <a href=\"https://marketplace.visualstudio.com/items?itemName=aminer.codegeex\" target=\"_blank\">VS Code</a>, <a href=\"https://plugins.jetbrains.com/plugin/20587-codegeex\" target=\"_blank\">Jetbrains</a>｜🤗 <a href=\"https://huggingface.co/THUDM/codegeex2-6b\" target=\"_blank\">模型下载</a>｜📄 <a href=\"https://arxiv.org/abs/2303.17568\" target=\"_blank\">论文</a>｜👋 加入<a href=\"resources/wechat.md\"target=\"_blank\">微信开发者交流群</a>\n</p>\n\nRead this in [English](README_EN.md)<br>\n[日本語](README_JA.md)で読む<br>\nLire en [Français](README_FR.md)\n\n⭐️ 最新一代 [CodeGeeX4](https://github.com/THUDM/CodeGeeX4) 模型已经正式开源。\nThe newest [CodeGeeX4](https://github.com/THUDM/CodeGeeX4) has been released.\n\n# CodeGeeX2: 更强大的多语言代码生成模型\n\nCodeGeeX2 是多语言代码生成模型 [CodeGeeX](https://github.com/THUDM/CodeGeeX) ([KDD’23](https://arxiv.org/abs/2303.17568)) 的第二代模型。不同于一代 CodeGeeX（完全在国产华为昇腾芯片平台训练） ，CodeGeeX2 是基于 [ChatGLM2](https://github.com/THUDM/ChatGLM2-6B) 架构加入代码预训练实现，得益于 ChatGLM2 的更优性能，CodeGeeX2 在多项指标上取得性能提升（+107% > CodeGeeX；仅60亿参数即超过150亿参数的 StarCoder-15B 近10%），更多特性包括：\n\n* **更强大的代码能力**：基于 ChatGLM2-6B 基座语言模型，CodeGeeX2-6B 进一步经过了 600B 代码数据预训练，相比一代模型，在代码能力上全面提升，[HumanEval-X](https://huggingface.co/datasets/THUDM/humaneval-x) 评测集的六种编程语言均大幅提升 (Python +57%, C++ +71%, Java +54%, JavaScript +83%, Go +56%, Rust +321\\%)，在Python上达到 35.9\\% 的 Pass@1 一次通过率，超越规模更大的 StarCoder-15B。\n* **更优秀的模型特性**：继承 ChatGLM2-6B 模型特性，CodeGeeX2-6B 更好支持中英文输入，支持最大 8192 序列长度，推理速度较一代 CodeGeeX-13B 大幅提升，量化后仅需6GB显存即可运行，支持轻量级本地化部署。\n* **更全面的AI编程助手**：CodeGeeX插件（[VS Code](https://marketplace.visualstudio.com/items?itemName=aminer.codegeex), [Jetbrains](https://plugins.jetbrains.com/plugin/20587-codegeex)）后端升级，支持超过100种编程语言，新增上下文补全、跨文件补全等实用功能。结合 Ask CodeGeeX 交互式AI编程助手，支持中英文对话解决各种编程问题，包括且不限于代码解释、代码翻译、代码纠错、文档生成等，帮助程序员更高效开发。\n* **更开放的协议**：CodeGeeX2-6B 权重对学术研究完全开放，填写[登记表](https"},{"ref":"P11","kind":"page","title":"zai-org/RelayDiffusion repository metadata","date":"2026-06-11T01:45:59.936732+00:00","date_source":null,"source_url":"https://github.com/zai-org/RelayDiffusion","signal_url":null,"signal_json_url":null,"text":"# zai-org/RelayDiffusion\n\nDescription: The official implementation of \"Relay Diffusion: Unifying diffusion process across resolutions for image synthesis\" [ICLR 2024 Spotlight]\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 316\n\nForks: 19\n\nOpen issues: 2\n\nCreated: 2023-09-04T14:28:18Z\n\nPushed: 2024-04-29T09:29:51Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n## Relay Diffusion: Unifying diffusion process across resolutions for image synthesis <br><sub>Official Pytorch Implementation 🌐[[WiseModel]](https://www.wisemodel.cn/models/ZhipuAI/RelayDiffsuon/intro) 🌐[[Model Scope]](https://www.modelscope.cn/models/ZhipuAI/RelayDiffusion/summary)</sub>\n\n🎉**News!** The paper of RelayDiffusion has been accepted by ICLR 2024 (**Spotlight**)!\n\n![](resources/samples.jpg)\n\nWe propose ***Relay Diffusion Model (RDM)*** as a better framework for diffusion generation. ***RDM*** transfers a low-resolution image or noise into an equivalent high-resolution one via blurring diffusion and block noise. Therefore, the diffusion process can continue seamlessly in any new resolution or model without restarting from pure noise or low-resolution conditioning.\n\nRDM achieved **state-of-the-art** FID on CelebA-HQ and sFID ImageNet-256 (FID=1.87)!\n\nFor a formal introduction, Read our paper: [Relay Diffusion: Unifying diffusion process across resolutions for image synthesis](https://arxiv.org/abs/2309.03350).\n\n## Setup\n\n### Environment\n\nDownload the repo and setup the environment with:\n\n```bash\ngit clone https://github.com/THUDM/RelayDiffusion.git\ncd RelayDiffusion\nconda env create -f environment.yml\nconda activate rdm\n```\n\nWe enable `xformers.ops.memory_efficient_attention` to reduce about 15% training cost. If there is no need you can also remove `xformers` from `environment.yml`.\n\nLinux servers with Nvidia A100s are recommended. However, by setting smaller `--batch-gpu` (batch size on a single gpu), you can still run the inference and training scripts on less powerful GPUs.\n\n### Dataset\n\nWe preprocess and implement datasets with the same format as [EDM](https://github.com/NVlabs/edm). For CelebA-HQ, follow [*Progressive Growing of GANs for Improved Quality, Stability, and Variatio"},{"ref":"P12","kind":"page","title":"zai-org/CogVLM repository metadata","date":"2026-06-11T01:45:59.883545+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogVLM","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogVLM\n\nDescription: a state-of-the-art-level open visual language model | 多模态预训练模型\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 6738\n\nForks: 453\n\nOpen issues: 70\n\nCreated: 2023-09-18T02:12:50Z\n\nPushed: 2024-05-29T10:01:33Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# CogVLM & CogAgent\n\n📗 [中文版README](./README_zh.md)\n\n🌟 **Jump to detailed introduction: [Introduction to CogVLM](#introduction-to-cogvlm)，\n🆕 [Introduction to CogAgent](#introduction-to-cogagent)**\n\n📔 For more detailed usage information, please refer to: [CogVLM & CogAgent's technical documentation (in Chinese)](https://zhipu-ai.feishu.cn/wiki/LXQIwqo1OiIVTykMh9Lc3w1Fn7g) \n\n<table>\n<tr>\n<td>\n<h2> CogVLM </h2>\n<p> 📖 Paper: <a href=\"https://arxiv.org/abs/2311.03079\">CogVLM: Visual Expert for Pretrained Language Models</a></p>\n<p><b>CogVLM</b> is a powerful open-source visual language model (VLM). CogVLM-17B has 10 billion visual parameters and 7 billion language parameters, <b>supporting image understanding and multi-turn dialogue with a resolution of 490*490</b>.</p>\n<p><b>CogVLM-17B achieves state-of-the-art performance on 10 classic cross-modal benchmarks</b>, including NoCaps, Flicker30k captioning, RefCOCO, RefCOCO+, RefCOCOg, Visual7W, GQA, ScienceQA, VizWiz VQA and TDIUC.</p>\n</td>\n<td>\n<h2> CogAgent </h2>\n<p> 📖 Paper: <a href=\"https://arxiv.org/abs/2312.08914\">CogAgent: A Visual Language Model for GUI Agents </a></p>\n<p><b>CogAgent</b> is an open-source visual language model improved based on CogVLM. CogAgent-18B has 11 billion visual parameters and 7 billion language parameters, <b>supporting image understanding at a resolution of 1120*1120</b>. <b>On top of the capabilities of CogVLM, it further possesses GUI image Agent capabilities</b>.</p>\n<p> <b>CogAgent-18B achieves state-of-the-art generalist performance on 9 classic cross-modal benchmarks</b>, including VQAv2, OK-VQ, TextVQA, ST-VQA, ChartQA, infoVQA, DocVQA, MM-Vet, and POPE. <b>It significantly surpasses existing models on GUI operation datasets</b> including AITW and Mind2Web.</p>\n</td>\n</tr>\n<tr>\n<td colspan=\"2\" align=\"center\">\n<p>🌐 Web Demo for both CogVLM2: <a href=\"http://36.103.203.44:7861\">this lin"},{"ref":"P13","kind":"page","title":"zai-org/CogAgent repository metadata","date":"2026-06-11T01:45:59.695138+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogAgent","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogAgent\n\nDescription: An open-sourced end-to-end VLM-based GUI Agent\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 1183\n\nForks: 99\n\nOpen issues: 29\n\nCreated: 2023-11-28T09:28:08Z\n\nPushed: 2025-04-04T13:29:55Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# CogAgent: An open-sourced VLM-based GUI Agent\n\n[中文文档](README_zh.md)\n\n- 🔥 🆕 **December 2024:** We open-sourced **the latest version of the CogAgent-9B-20241220 model**. Compared to the\nprevious version of CogAgent, `CogAgent-9B-20241220` features significant improvements in GUI perception, reasoning\naccuracy, action space completeness, task universality, and generalization. It supports bilingual (Chinese and\nEnglish) interaction through both screen captures and natural language.\n\n- 🏆 **June 2024:** CogAgent was accepted by **CVPR 2024** and recognized as a conference Highlight (top 3%).\n\n- **December 2023:** We **open-sourced the first GUI Agent**: **CogAgent** (with the former repository\navailable [here](https://github.com/THUDM/CogVLM)) and **published the corresponding paper:\n📖 [CogAgent Paper](https://arxiv.org/abs/2312.08914)**.\n\n## Model Introduction\n\n| Model | Model Download Links | Technical Documentation | Online Demo | \n|:--------------------:|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| \n| cogagent-9b-20241220 | [🤗 HuggingFace](https://huggingface.co/THUDM/cogagent-9b-20241220)<br> [🤖 ModelScope](https://modelscope.cn/models/ZhipuAI/cogagent-9b-20"},{"ref":"P14","kind":"page","title":"zai-org/ChatGLM3 repository metadata","date":"2026-06-11T01:45:59.619514+00:00","date_source":null,"source_url":"https://github.com/zai-org/ChatGLM3","signal_url":null,"signal_json_url":null,"text":"# zai-org/ChatGLM3\n\nDescription: ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 13679\n\nForks: 1591\n\nOpen issues: 35\n\nCreated: 2023-10-26T06:22:47Z\n\nPushed: 2025-01-13T06:55:26Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# ChatGLM3\n\n<p align=\"center\">\n📄<a href=\"https://arxiv.org/pdf/2406.12793\" target=\"_blank\"> Report </a> • 🤗 <a href=\"https://huggingface.co/THUDM/chatglm3-6b\" target=\"_blank\">HF Repo</a> • 🤖 <a href=\"https://modelscope.cn/models/ZhipuAI/chatglm3-6b\" target=\"_blank\">ModelScope</a> • 🟣 <a href=\"https://www.wisemodel.cn/models/ZhipuAI/chatglm3-6b\" target=\"_blank\">WiseModel</a> • 📔 <a href=\"https://lslfd0slxc.feishu.cn/wiki/WvQbwIJ9tiPAxGk8ywDck6yfnof\" target=\"_blank\">Document</a> • 🧰 <a href=\"https://openxlab.org.cn/models/hot/THUDM\" target=\"_blank\">OpenXLab</a> • 🐦 <a href=\"https://twitter.com/thukeg\" target=\"_blank\">Twitter</a><br>\n</p>\n<p align=\"center\">\n👋 加入我们的 <a href=\"https://discord.gg/fK2dz4bg\" target=\"_blank\">Discord</a> 和 <a href=\"resources/WECHAT.md\" target=\"_blank\">微信</a>\n</p>\n<p align=\"center\">\n📍在 <a href=\"https://www.chatglm.cn\">chatglm.cn</a> 体验更大规模的 ChatGLM 模型。\n</p>\n\n[Read this in English.](./README_en.md)\n\n📔 关于`ChatGLM3-6B` 更为详细的使用信息，可以参考\n\n+ [ChatGLM3 开放技术文档](https://lslfd0slxc.feishu.cn/wiki/WvQbwIJ9tiPAxGk8ywDck6yfnof?from=from_copylink)\n+ [Bilibili video](https://www.bilibili.com/video/BV1uC4y1J7yA)\n+ [YouTube video](https://www.youtube.com/watch?v=Pw9PB6R7ORA)\n\n## GLM-4 开源模型和API\n\n我们已经发布最新的 **GLM-4** 模型，该模型在多个指标上有了新的突破，您可以在以下两个渠道体验我们的最新模型。\n\n+ [GLM-4 开源模型](https://github.com/THUDM/GLM-4) 我们已经开源了 GLM-4-9B 系列模型，在各项指标的测试上有明显提升，欢迎尝试。\n+ [智谱清言](https://chatglm.cn/main/detail?fr=ecology_x) 体验最新版 GLM-4，包括 **GLMs，All tools**等功能。\n+ [API平台](https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9) 新一代 API 平台已经上线，您可以直接在\nAPI\n平台上体验 `GLM-4-0520`、`GLM-4-air`、`GLM-4-airx`、`GLM-4-flash`、`GLM-4`、`GLM-3-Turbo`、`CharacterGLM-3`，`CogView-3`\n等新模型。\n其中`GLM-4`、`GLM-3-Turbo`两个模型支持了 `System Prompt`、`Function Call`、 `Retrieval`、`Web_Search`等新功能，欢迎体验。\n\n+ [GLM-4 API 开源教程](https://github.com/MetaGLM/glm-cookbook/) GLM-4 API教程和基础应用，欢迎尝试。\nAPI相关问题可以在本开源教程疑问，或者使用 [GLM-4 API A"},{"ref":"P15","kind":"page","title":"zai-org/CogCoM repository metadata","date":"2026-06-11T01:45:59.226122+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogCoM","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogCoM\n\nLanguage: Jupyter Notebook\n\nLicense: NOASSERTION\n\nStars: 223\n\nForks: 14\n\nOpen issues: 20\n\nCreated: 2024-02-02T11:18:30Z\n\nPushed: 2024-07-05T09:56:19Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n<h2 align=\"center\"> <a href=\"https://arxiv.org/pdf/2402.04236\">CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations</a></h2>\n<h5 align=\"center\"> If you like our project, please give us a star ⭐ on GitHub for latest update.<br>\n\n[![hf_space](https://img.shields.io/badge/🤗-Open%20In%20Spaces-blue.svg)](https://huggingface.co/qijimrc/CogCoM)\n[![arXiv](https://img.shields.io/badge/Arxiv-2401.15947-b31b1b.svg?logo=arXiv)](https://arxiv.org/pdf/2402.04236) \n[![License](https://img.shields.io/badge/License-Apache%202.0-yellow)](https://github.com/THUDM/CogCoM/blob/main/LICENSE)\n</h5>\n\n<details><summary>💡 We also have other vision-language projects that may interest you ✨. </summary><p>\n<!-- may -->\n\n> [**CogVLM: Visual Expert for Pretrained Language Models**](https://github.com/THUDM/CogVLM) <br>\n[![github](https://img.shields.io/badge/-Github-black?logo=github)](https://github.com/THUDM/CogVLM) [![github](https://img.shields.io/github/stars/THUDM/CogVLM.svg?style=social&label=Star)](https://github.com/THUDM/CogVLM) <br>\n> [**CogAgent: A Visual Language Model for GUI Agents**](https://github.com/THUDM/CogVLM) <br>\n[![github](https://img.shields.io/badge/-Github-black?logo=github)](https://github.com/THUDM/CogVLM) [![github](https://img.shields.io/github/stars/THUDM/CogVLM.svg?style=social&label=Star)](https://github.com/THUDM/CogVLM) <br>\n\n</p></details>\n\n## 📣 News\n* **[2024/6/15]** 🎉 Release our prepared datasets, the synthesized 84K data and manually annotated 7K math data (see in [Data](/cogcom/data) or [HuggingFace](https://huggingface.co/datasets/qijimrc/CoMDataset)).\n* **[2024/2/26]** 🎉 Release the chat model CogCoM-chat-17b.\n* **[2024/2/26]** 🎉 Release the grounding model CogCoM-grounding-17b.\n* **[2024/2/4]** 🎉 Release the base model CogCoM-base-17b.\n\n## 😮 Highlights\n\nCogCoM enables VLMs to solve various visual problems step-by-step with evidence, without involving external tools.\n\n<p align=\"ce"},{"ref":"P16","kind":"page","title":"zai-org/CogVLM2 repository metadata","date":"2026-06-11T01:45:59.131512+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogVLM2","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogVLM2\n\nDescription: GPT4V-level open-source multi-modal model based on Llama3-8B\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 2438\n\nForks: 163\n\nOpen issues: 61\n\nCreated: 2024-05-10T09:07:11Z\n\nPushed: 2025-03-03T03:01:31Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# CogVLM2 & CogVLM2-Video\n\n[中文版README](./README_zh.md)\n\n<div align=\"center\">\n<img src=resources/logo.svg width=\"40%\"/> \n</div>\n\n<p align=\"center\">\n👋 Join our <a href=\"resources/WECHAT.md\" target=\"_blank\">Wechat</a> · 💡Try CogVLM2 <a href=\"http://cogvlm2-online.cogviewai.cn:7861/\" target=\"_blank\">Online</a> 💡Try CogVLM2-Video <a href=\"http://cogvlm2-online.cogviewai.cn:7868/\" target=\"_blank\">Online</a>\n</p>\n<p align=\"center\">\n📍Experience the larger-scale CogVLM model on the <a href=\"https://open.bigmodel.cn/?utm_campaign=open&_channel_track_key=OWTVNma9\">ZhipuAI Open Platform</a>.\n</p>\n\n## Recent updates\n- 🔥 **News**: ``2024/8/30``: The [CogVLM2 paper](https://arxiv.org/abs/2408.16500) has been published on arXiv.\n- 🔥 **News**: ``2024/7/12``: We have released CogVLM2-Video [online web demo](http://cogvlm2-online.cogviewai.cn:7868/), welcome to experience it.\n- 🔥 **News**: ``2024/7/8``: We released the video understanding version of the CogVLM2 model, the CogVLM2-Video model.\nBy extracting keyframes, it can interpret continuous images. The model can support videos of up to 1 minute. See more\nin our [blog](https://cogvlm2-video.github.io/).\n- 🔥 **News**: ``2024/6/8``:We release [CogVLM2 TGI Weight](https://huggingface.co/THUDM/cogvlm2-llama3-chat-19B-tgi),\nwhich is a model can be inferred in [TGI](https://huggingface.co/docs/text-generation-inference/en/index). See\nInference Code in [here](https://github.com/leizhao1234/cogvlm2)\n- 🔥 **News**: ``2024/6/5``:We release [GLM-4V-9B](https://huggingface.co/THUDM/glm-4v-9b), which use the same data and\ntraining recipes as CogVLM2 but with GLM-9B as the language backbone. We removed visual experts to reduce the model\nsize to 13B. More details at [GLM-4 repo](https://github.com/THUDM/GLM-4/).\n- 🔥 **News**: ``2024/5/24``: We have released\nthe [Int4 version model](https://huggingface.co/THUDM/cogvlm2-llama3-chat-19B-int4), which re"},{"ref":"P17","kind":"page","title":"zai-org/Inf-DiT repository metadata","date":"2026-06-11T01:45:59.008486+00:00","date_source":null,"source_url":"https://github.com/zai-org/Inf-DiT","signal_url":null,"signal_json_url":null,"text":"# zai-org/Inf-DiT\n\nDescription: Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 445\n\nForks: 25\n\nOpen issues: 24\n\nCreated: 2024-05-07T07:02:59Z\n\nPushed: 2024-07-05T10:35:10Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Inf-DiT\n\n[![arXiv](https://img.shields.io/badge/arXiv-2405.04312-b31b1b.svg)](https://arxiv.org/abs/2405.04312)[![Page Views Count](https://badges.toozhao.com/badges/01HXBVPE6J3YKGEWCFSBRAXFAK/blue.svg)](https://badges.toozhao.com/stats/01HXBVPE6J3YKGEWCFSBRAXFAK \"Get your own page views count badge on badges.toozhao.com\")\n\nOfficial implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer\n\n![1715078130760](image/README/frontpage.png)\n\n## 🆕 News\n\n* **2024.07.01**: Inf-DiT has been accepted by ECCV2024!\n* **2024.05.20**: This code and model weight is released.\n* **2024.05.08**: This repo is released.\n\n## ⏳ TODO\n\n- [x] Code release\n- [x] Model weight release\n- [x] Complete the explanation for the inference code and hyperparameter\n- [ ] Demo\n- [ ] Comfyui\n\n## 🔆 Abstract\n\nDiffusion models have shown remarkable performance in image generation in recent years. However, due to a quadratic increase in memory during generating ultra-high-resolution images (e.g. 4096 × 4096), the resolution of generated images is often limited to 1024×1024. In this work, we propose a unidirectional block attention mechanism that can adaptively adjust the memory overhead during the inference process and handle global dependencies. Building on this module, we adopt the DiT structure for upsampling and develop an infinite super-resolution model capable of upsampling images of various shapes and resolutions. Comprehensive experiments show that our model achieves excellent performance in generating ultra-high-resolution images. Compared to commonly used UNet structures, our model can save more than 5× memory when generating 4096 × 4096 images.\n\n## 📚 Model Inference\nModel weights can be downloaded from [here](https://cloud.tsinghua.edu.cn/f/6e313f7e1236468e973b/?dl=1)\n\n1. Download the model weights and put them i"},{"ref":"P18","kind":"page","title":"zai-org/GLM-4 repository metadata","date":"2026-06-11T01:45:58.868982+00:00","date_source":null,"source_url":"https://github.com/zai-org/GLM-4","signal_url":null,"signal_json_url":null,"text":"# zai-org/GLM-4\n\nDescription: GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 7065\n\nForks: 616\n\nOpen issues: 38\n\nCreated: 2024-05-15T05:17:08Z\n\nPushed: 2025-07-04T04:16:36Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# GLM-4-0414 Model Series\n\n<p align=\"center\">\n👋 Join our <a href=\"https://discord.gg/8cnQKdAprg\" target=\"_blank\">Discord</a>, <a href=\"https://x.com/Zai_org\" target=\"_blank\">X</a> and <a href=\"resources/WECHAT.md\" target=\"_blank\"> WeChat (Chinese) </a>\n</p>\n<p align=\"center\">\n📍The open-source models released this time can be experienced for free at <a href=\"https://chat.z.ai\">Z.ai</a>; for GLM commercial model services, please visit <a href=\"https://bigmodel.cn\">bigmodel.cn</a>.\n</p>\n\nRead this in [中文](README_zh.md)\n\n## Project Updates\n\n- 🔥 **News**: ```2025/07/02```: We are releasing the [GLM-4.1V-9B-Thinking](https://huggingface.co/collections/THUDM/glm-41v-thinking-6862bbfc44593a8601c2578d) series VLM, check [this github repo](https://github.com/THUDM/GLM-4.1V-Thinking) to get more information.\n- **News**: ```2025/04/14```: We are releasing the [GLM-4-32B-0414](https://huggingface.co/collections/THUDM/glm-4-0414-67f3cbcb34dd9d252707cb2e) series models, scaled up to 32B parameters, including models with capabilities for dialogue, reasoning, and rumination.\n- **News**: ``2024/06/18``: We have released our [Technical Report](https://arxiv.org/pdf/2406.12793), feel free to check it out.\n- **News**: ``2024/06/05``: We released the `GLM-4-9B` series of open-source models. Details can be found [here](README_20240605.md).\n\n## Model Introduction\n\nThe GLM family welcomes new members, the **GLM-4-32B-0414** series models, featuring 32 billion parameters. Its performance is comparable to OpenAI’s GPT series and DeepSeek’s V3/R1 series. It also supports very user-friendly local deployment features. GLM-4-32B-Base-0414 was pre-trained on 15T of high-quality data, including substantial reasoning-type synthetic data. This lays the foundation for subsequent reinforcement learning extensions. In the post-training stage, we employed human preference alignment for dialogue scenarios. Add"},{"ref":"P19","kind":"page","title":"zai-org/LVBench repository metadata","date":"2026-06-11T01:45:58.837142+00:00","date_source":null,"source_url":"https://github.com/zai-org/LVBench","signal_url":null,"signal_json_url":null,"text":"# zai-org/LVBench\n\nDescription: [ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark\n\nLanguage: Python\n\nStars: 143\n\nForks: 6\n\nOpen issues: 12\n\nCreated: 2024-05-29T07:48:02Z\n\nPushed: 2025-07-09T07:47:57Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# LVBench: An Extreme Long Video Understanding Benchmark\n\n![VideoQA](https://img.shields.io/badge/Task-VideoQA-red)\n![Multi-Modal](https://img.shields.io/badge/Task-Multi--Modal-red)\n![LVBench](https://img.shields.io/badge/Dataset-LVBench-blue) \n![Gemini](https://img.shields.io/badge/Model-Gemini-green)\n![GPT-4o](https://img.shields.io/badge/Model-GPT--4o-green)\n![LLaVA-NEXT](https://img.shields.io/badge/Model-LLaVA--NeXT-green)\n![MovieChat](https://img.shields.io/badge/Model-MovieChat-green)\n\n<font size=7><div align='center' > [[🍎 Project Page](https://lvbench.github.io/)] [[📖 arXiv Paper](https://arxiv.org/abs/2406.08035)] [[📊 Dataset](https://huggingface.co/datasets/THUDM/LVBench)][[🏆 Leaderboard](https://lvbench.github.io/#leaderboard)][[🏆 Huggingface Leaderboard](https://huggingface.co/spaces/THUDM/LVBench)] </div></font>\n\n<p align=\"center\">\n<img src=\"./docs/images/cover.png\" width=\"96%\" height=\"50%\">\n</p>\n\nLVBench is a benchmark designed to evaluate and enhance the capabilities of multimodal models in understanding and\nextracting information from long videos up to two hours in duration.\n\n---\n\n## 🔥 News\n\n* **`2024.08.2`** 🌟 We setup the LVBench Leaderboard on Huggingface Spaces! Have a check on the [Leaderboard](https://huggingface.co/spaces/THUDM/LVBench).\n* **`2024.06.11`** 🌟 We released LVBench, a new benchmark for long video understanding!\n\n## 👀 Introduce to LVBench\n\nLVBench is a benchmark designed to evaluate the capabilities of models in understanding long videos. We collected\nextensive long video data from public sources, annotated through a mix of manual effort and model assistance. Our\nbenchmark provides a robust foundation for testing models on extended temporal contexts, ensuring high-quality\nassessment through meticulous human annotation and multi-stage quality control.\n\n### Features\n\n1. **Core Capabilities**: Six core capabilities for long video understanding, enabling"},{"ref":"P20","kind":"page","title":"zai-org/CogView4 repository metadata","date":"2026-06-11T01:45:58.358175+00:00","date_source":null,"source_url":"https://github.com/zai-org/CogView4","signal_url":null,"signal_json_url":null,"text":"# zai-org/CogView4\n\nDescription: CogView4, CogView3-Plus and CogView3(ECCV 2024)\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 1103\n\nForks: 81\n\nOpen issues: 12\n\nCreated: 2024-09-23T06:38:31Z\n\nPushed: 2025-03-29T07:09:58Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# CogView4 & CogView3 & CogView-3Plus\n\n[阅读中文版](./README_zh.md)\n[日本語で読む](./README_ja.md)\n\n<div align=\"center\">\n<img src=resources/logo.svg width=\"50%\"/>\n</div>\n\n<p align=\"center\">\n<a href=\"https://huggingface.co/spaces/THUDM-HF-SPACE/CogView4\" target=\"_blank\"> 🤗 HuggingFace Space</a>\n<a href=\"https://modelscope.cn/studios/ZhipuAI/CogView4\" target=\"_blank\"> 🤖ModelScope Space</a>\n<a href=\"https://zhipuaishengchan.datasink.sensorsdata.cn/t/4z\" target=\"_blank\"> 🛠️ZhipuAI MaaS(Faster)</a>\n<br>\n<a href=\"resources/WECHAT.md\" target=\"_blank\"> 👋 WeChat Community</a> <a href=\"https://arxiv.org/abs/2403.05121\" target=\"_blank\">📚 CogView3 Paper</a>\n</p>\n\n![showcase.png](resources/showcase.png)\n\n## Project Updates\n\n- 🔥🔥 ```2025/03/24```: We are launching [CogKit](https://github.com/THUDM/CogKit), a powerful toolkit for fine-tuning and inference of the **CogView4** and **CogVideoX** series, allowing you to fully explore our multimodal generation models.\n- ```2025/03/04```: We've adapted and open-sourced the [diffusers](https://github.com/huggingface/diffusers) version\nof **CogView-4** model, which has 6B parameters, supports native Chinese input, and Chinese text-to-image generation.\nYou can try it [online](https://huggingface.co/spaces/THUDM-HF-SPACE/CogView4).\n- ```2024/10/13```: We've adapted and open-sourced the [diffusers](https://github.com/huggingface/diffusers) version of\n**CogView-3Plus-3B** model. You can try\nit [online](https://huggingface.co/spaces/THUDM-HF-SPACE/CogView3-Plus-3B-Space).\n- ```2024/9/29```: We've open-sourced **CogView3** and **CogView-3Plus-3B**. **CogView3** is a text-to-image system\nbased on cascading diffusion, using a relay diffusion framework. **CogView-3Plus** is a series of newly developed\ntext-to-image models based on Diffusion Transformer.\n\n## Project Plan\n\n- [X] Diffusers workflow adaptation\n- [X] Cog series fine-tuning kits (coming soon)\n- [ ] ControlNet model"},{"ref":"P21","kind":"page","title":"zai-org/GLM-4-Voice repository metadata","date":"2026-06-11T01:45:58.293995+00:00","date_source":null,"source_url":"https://github.com/zai-org/GLM-4-Voice","signal_url":null,"signal_json_url":null,"text":"# zai-org/GLM-4-Voice\n\nDescription: GLM-4-Voice | 端到端中英语音对话模型\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 3187\n\nForks: 281\n\nOpen issues: 68\n\nCreated: 2024-10-24T12:12:32Z\n\nPushed: 2024-12-05T07:10:01Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# GLM-4-Voice\n<p align=\"center\">\n📄<a href=\"https://arxiv.org/abs/2412.02612\" target=\"_blank\"> Report </a> • 🤗 <a href=\"https://huggingface.co/THUDM/glm-4-voice-9b\" target=\"_blank\">HF Repo</a> • 🤖 <a href=\"https://modelscope.cn/studios/ZhipuAI/GLM-4-Voice-Demo\" target=\"_blank\">Demo</a> • 🐦 <a href=\"https://twitter.com/thukeg\" target=\"_blank\">Twitter</a>\n</p>\n\nRead this in [English](./README_en.md)\n\nGLM-4-Voice 是智谱 AI 推出的端到端语音模型。GLM-4-Voice 能够直接理解和生成中英文语音，进行实时语音对话，并且能够遵循用户的指令要求改变语音的情感、语调、语速、方言等属性。\n\n## Model Architecture\n![Model Architecture](./resources/architecture.jpeg)\n\nGLM-4-Voice 由三个部分组成：\n* GLM-4-Voice-Tokenizer: 通过在 [Whisper](https://github.com/openai/whisper) 的 Encoder 部分增加 Vector Quantization 并在 ASR 数据上有监督训练，将连续的语音输入转化为离散的 token。每秒音频平均只需要用 12.5 个离散 token 表示。\n* GLM-4-Voice-Decoder: 基于 [CosyVoice](https://github.com/FunAudioLLM/CosyVoice) 的 Flow Matching 模型结构训练的支持流式推理的语音解码器，将离散化的语音 token 转化为连续的语音输出。最少只需要 10 个语音 token 即可开始生成，降低端到端对话延迟。\n* GLM-4-Voice-9B: 在 [GLM-4-9B](https://github.com/THUDM/GLM-4) 的基础上进行语音模态的预训练和对齐，从而能够理解和生成离散化的语音 token。\n\n预训练方面，为了攻克模型在语音模态下的智商和合成表现力两个难关，我们将 Speech2Speech 任务解耦合为“根据用户音频做出文本回复”和“根据文本回复和用户语音合成回复语音”两个任务，并设计两种预训练目标，分别基于文本预训练数据和无监督音频数据合成语音-文本交错数据以适配这两种任务形式。GLM-4-Voice-9B 在 GLM-4-9B 的基座模型基础之上，经过了数百万小时音频和数千亿 token 的音频文本交错数据预训练，拥有很强的音频理解和建模能力。\n\n对齐方面，为了支持高质量的语音对话，我们设计了一套流式思考架构：根据用户语音，GLM-4-Voice 可以流式交替输出文本和语音两个模态的内容，其中语音模态以文本作为参照保证回复内容的高质量，并根据用户的语音指令要求做出相应的声音变化，在最大程度保留语言模型智商的情况下仍然具有端到端建模的能力，同时具备低延迟性，最低只需要输出 20 个 token 便可以合成语音。\n\n## Model List\n\n| Model | Type | Download |\n|:---------------------:|:----------------:|:------------------------------------------------------------------------------------------------------------------------------------------------:|\n| GLM-4-Voice-Tokenizer | Speech Tokenizer | [🤗 Huggingface](https://huggingface.co/THUDM/glm-4-voice-tokenizer) [🤖 ModelScope](https://modelscope.cn/models/ZhipuAI/glm-4-voice-tokenizer) |\n| GLM-4-Voice-9B | Chat Model | "},{"ref":"P22","kind":"page","title":"zai-org/CodeGeeX4 repository metadata","date":"2026-06-11T01:45:58.288131+00:00","date_source":null,"source_url":"https://github.com/zai-org/CodeGeeX4","signal_url":null,"signal_json_url":null,"text":"# zai-org/CodeGeeX4\n\nDescription: CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 2530\n\nForks: 264\n\nOpen issues: 120\n\nCreated: 2024-07-03T11:01:55Z\n\nPushed: 2024-08-25T14:26:16Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n![](resources/logo.jpeg)\n\n<p align=\"center\">\n🏠 <a href=\"https://codegeex.cn\" target=\"_blank\">Homepage</a>｜🛠 Extensions <a href=\"https://marketplace.visualstudio.com/items?itemName=aminer.codegeex\" target=\"_blank\">VS Code</a>, <a href=\"https://plugins.jetbrains.com/plugin/20587-codegeex\" target=\"_blank\">Jetbrains</a>｜🤗 <a href=\"https://huggingface.co/THUDM/codegeex4-all-9b\" target=\"_blank\">HF Repo</a> | 🪧 <a href=\"https://huggingface.co/spaces/THUDM/CodeGeeX\" target=\"_blank\">HF DEMO</a>\n</p>\n\n[English](./README.md) | [中文](./README_zh.md) | [日本語](./README_ja.md)\n\n# CodeGeeX4: Open Multilingual Code Generation Model\n\nWe introduce CodeGeeX4-ALL-9B, the open-source version of the latest CodeGeeX4 model series. It is a multilingual code generation model continually trained on the [GLM-4-9B](https://github.com/THUDM/GLM-4), significantly enhancing its code generation capabilities. Using a single CodeGeeX4-ALL-9B model, it can support comprehensive functions such as code completion and generation, code interpreter, web search, function call, repository-level code Q&A, covering various scenarios of software development. CodeGeeX4-ALL-9B has achieved highly competitive performance on public benchmarks, such as [BigCodeBench](https://huggingface.co/datasets/bigcode/bigcodebench) and [NaturalCodeBench](https://github.com/THUDM/NaturalCodeBench). It is currently the most powerful code generation model with less than 10B parameters, even surpassing much larger general-purpose models, achieving the best balance in terms of inference speed and model performance.\n\n## Model List\n\n| Model | Type | Seq Length | Download |\n|-------------------|------|------------|-------------------------------------------------------------------------------------------------------"},{"ref":"P23","kind":"page","title":"zai-org/GLM-Edge repository metadata","date":"2026-06-11T01:45:58.135892+00:00","date_source":null,"source_url":"https://github.com/zai-org/GLM-Edge","signal_url":null,"signal_json_url":null,"text":"# zai-org/GLM-Edge\n\nDescription: GLM Series Edge Models\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 163\n\nForks: 17\n\nOpen issues: 7\n\nCreated: 2024-11-19T03:26:48Z\n\nPushed: 2025-06-12T02:10:02Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# GLM-Edge\n\nRead this in [English](README_en.md)\n\n在 <a href=\"https://huggingface.co/spaces/THUDM-HF-SPACE/GLM-Edge-1.5B-Chat-Space\" target=\"_blank\"> 🤗 这里</a> 体验 GLM-Edge-1.5B-Chat 端侧模型\n\n在 <a href=\"https://huggingface.co/spaces/THUDM-HF-SPACE/GLM-Edge-V-5B-Space\" target=\"_blank\"> 🤗 这里</a> 或者 <a href=\"https://modelscope.cn/studios/ZhipuAI/GLM-Edge-V-5B-Demo\" target=\"_blank\"> 🤖 这里</a> 体验 GLM-Edge-V-5B 端侧模型\n\n## 模型介绍\n\n**GLM-Edge** 系列是我们在面向端侧真实落地使用的场景下的一次尝试，由两种尺寸的大语言对话模型和多模态理解模型组成（\n`GLM-Edge-1.5B-Chat`，`GLM-Edge-4B-Chat`，`GLM-Edge-V-2B`，`GLM-Edge-V-5B`）。其中，`1.5B / 2B`模型主要面向手机、车机等平台，\n`4B / 5B` 模型主要面向PC等平台。\n\n基于GLM-4系列的技术积累，我们针对端侧实际部署情况，对模型结构和尺寸做了针对性的调整，以求在模型表现、实机推理效果和落地便利度之间达到平衡。同时，通过与伙伴企业的深入合作和在推理优化上的不懈努力，在一些端侧平台上，GLM-Edge系列模型能以极快的速度运行。\n\n例如，在高通骁龙8 Elite平台上，借助其强大的NPU算力，GLM-Edge通过混合量化方案，1.5B对话模型、2B多模态模型能实现每秒60\ntokens以上的解码速度。在应用投机采样技术之后，两个模型能以峰值每秒100 tokens以上的解码速度运行。**这些推理方案会由我们或合作伙伴后续放出。暂时不会在本仓库提供。**\n模型下载地址：\n\n| Model | HuggingFace Model | GGUF Model |\n|:------------------:|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------:|\n| GLM-Edge-1.5B-Chat | [🤗 Huggingface](https://huggingface.co/THUDM/glm-edge-1.5b-chat)<br> [🤖 ModelScope](https://modelscope.cn/models/ZhipuAI/glm-edge-1.5b-chat) <br> [🟣 WiseModel](https://wisemodel.cn/models/ZhipuAI/glm-edge-1.5b-chat) | [🤗 Huggingface](https://huggingface.co/THUDM/glm-edge-1.5b-chat-gguf)<br> [🤖 ModelScope](https://modelscope.cn/models/ZhipuAI/glm-edge-1.5b-chat-gguf) <br> [🟣 WiseModel](https://wisemodel.cn/models/ZhipuAI/glm-edge-1.5b-chat-gguf) |\n| GLM-Edge-4B-Chat | [🤗 Huggi"},{"ref":"P24","kind":"page","title":"zai-org/MotionBench repository metadata","date":"2026-06-11T01:45:57.878871+00:00","date_source":null,"source_url":"https://github.com/zai-org/MotionBench","signal_url":null,"signal_json_url":null,"text":"# zai-org/MotionBench\n\nDescription: Official code for MotionBench (CVPR 2025)\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 74\n\nForks: 3\n\nOpen issues: 9\n\nCreated: 2024-11-25T12:03:53Z\n\nPushed: 2025-03-03T06:36:49Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models\n\n![VideoQA](https://img.shields.io/badge/Task-VideoQA-red)\n![Multi-Modal](https://img.shields.io/badge/Task-Multi--Modal-red)\n![MotionBench](https://img.shields.io/badge/Dataset-MotionBench-blue) \n\n<font size=7><div align='center' > [[🍎 Project Page](https://motion-bench.github.io/)] [[📖 arXiv Paper](https://arxiv.org/abs/2501.02955)] [[📊 Dataset](https://huggingface.co/datasets/THUDM/MotionBench)][[🏆 Leaderboard](https://motion-bench.github.io/#leaderboard)][[🏆 Huggingface Leaderboard](https://huggingface.co/spaces/THUDM/MotionBench)] </div></font>\n\n<p align=\"center\">\n<img src=\"./docs/image1.png\" width=\"96%\" height=\"50%\">\n</p>\n\nMotionBench aims to guide and motivate the development of more capable video understanding models, emphasizing the importance of fine-grained motion comprehension.\n\n---\n\n## 🔥 News\n\n* **`2025.02.27`** 🎉🎉🎉 MotionBench is accepted by CVPR 2025!!\n* **`2025.01.06`** 🌟🌟🌟 We released MotionBench, a new benchmark for fine-grained motion comprehension!\n\n## Introduction\n\nIn recent years, vision language models (VLMs) have made significant advancements in video understanding. However, a crucial capability — fine-grained motion comprehension — remains under-explored in current benchmarks. To address this gap, we propose MotionBench, a comprehensive evaluation benchmark designed to assess the fine-grained motion comprehension of video understanding models. \n\n### Features\n\n1. **Core Capabilities**: Six core capabilities for fine-grained motion understanding, enabling the evaluation of motion-level perception.\n2. **Diverse Data**: MotionBench collects diverse video from the web, public datasets, and self-synthetic videos generated via Unity3, capturing a broad distribution of real-world\napplication.\n3. **High-Quality Annotations**: Reliable benchmark with meticulous hu"},{"ref":"P25","kind":"page","title":"zai-org/z-ai-sdk-python repository metadata","date":"2026-06-11T01:45:57.554667+00:00","date_source":null,"source_url":"https://github.com/zai-org/z-ai-sdk-python","signal_url":null,"signal_json_url":null,"text":"# zai-org/z-ai-sdk-python\n\nDescription: The official Python SDK for Z.ai's large model open interface, making it easier for developers to call Z.ai's open APIs.\n\nLanguage: Python\n\nLicense: MIT\n\nStars: 110\n\nForks: 28\n\nOpen issues: 8\n\nCreated: 2025-07-01T11:15:24Z\n\nPushed: 2026-06-03T09:59:27Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Z.ai Open Platform Python SDK\n\n[![PyPI version](https://img.shields.io/pypi/v/zai-sdk.svg)](https://pypi.org/project/zai-sdk/)\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n[![Python](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)\n\n[中文文档](README_CN.md) | English\n\n[Z.ai Open Platform](https://docs.z.ai/) The official Python SDK for Z.ai's large model open interface, making it easier for developers to call Z.ai's open APIs.\n\n## ✨ Core Features\n\n### 🤖 **Chat Completions**\n- **Standard Chat**: Create chat completions with various models including `glm-5.1`\n- **Streaming Support**: Real-time streaming responses for interactive applications\n- **Tool Calling**: Function calling capabilities for enhanced AI interactions\n- **Multimodal Chat**: Image understanding capabilities with vision models\n\n### 🧠 **Embeddings**\n- **Text Embeddings**: Generate high-quality vector embeddings for text\n- **Configurable Dimensions**: Customizable embedding dimensions\n- **Batch Processing**: Support for multiple inputs in a single request\n\n### 🎥 **Video Generation**\n- **Text-to-Video**: Generate videos from text prompts\n- **Image-to-Video**: Create videos from image inputs\n- **Customizable Parameters**: Control quality, duration, FPS, and size\n- **Audio Support**: Optional audio generation for videos\n\n### 🎵 **Audio Processing**\n- **Speech Transcription**: Convert audio files to text\n- **Multiple Formats**: Support for various audio file formats\n\n### 🤝 **Assistant API**\n- **Conversation Management**: Structured conversation handling\n- **Streaming Conversations**: Real-time assistant interactions\n- **Metadata Support**: Rich conversation context and user information\n\n### 🔧 **Advanced Tools**\n- **Web Search**: Integrated web search capabiliti"},{"ref":"P26","kind":"page","title":"zai-org/GLM-V repository metadata","date":"2026-06-11T01:45:57.533775+00:00","date_source":null,"source_url":"https://github.com/zai-org/GLM-V","signal_url":null,"signal_json_url":null,"text":"# zai-org/GLM-V\n\nDescription: GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 2326\n\nForks: 171\n\nOpen issues: 12\n\nCreated: 2025-06-28T08:44:06Z\n\nPushed: 2026-05-16T05:42:14Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# GLM-V\n\n[中文阅读.](./README_zh.md)\n\n<div align=\"center\">\n<img src=resources/logo.svg width=\"40%\"/>\n</div>\n<p align=\"center\">\n👋 Join our <a href=\"resources/WECHAT.md\" target=\"_blank\">WeChat</a> and <a href=\"https://discord.gg/eQbGCYS9ym\" target=\"_blank\">Discord</a> communities.\n<br>\n📖 Check out the GLM-4.6V <a href=\"https://z.ai/blog/glm-4.6v\" target=\"_blank\">blog</a> and GLM-4.5V & GLM-4.1V <a href=\"https://arxiv.org/abs/2507.01006\" target=\"_blank\">paper</a>.\n<br>\n📍 Try <a href=\"https://chat.z.ai/\" target=\"_blank\">online</a> or use the <a href=\"https://docs.z.ai/guides/vlm/glm-4.6v\" target=\"_blank\">API</a>.\n</p>\n\n## Introduction\n\nVision-language models (VLMs) have become a key cornerstone of intelligent systems. As real-world AI tasks grow\nincreasingly complex, VLMs urgently need to enhance reasoning capabilities beyond basic multimodal perception —\nimproving accuracy, comprehensiveness, and intelligence — to enable complex problem solving, long-context understanding,\nand multimodal agents.\n\nThrough our open-source work, we aim to explore the technological frontier together with the community while empowering\nmore developers to create exciting and innovative applications.\n\n**This open-source repository contains our `GLM-4.6V`, `GLM-4.5V` and `GLM-4.1V` series models.** For performance and\ndetails, see [Model Overview](#model-overview). For known issues,\nsee [Fixed and Remaining Issues](#fixed-and-remaining-issues).\n\n## Project Updates\n\n- **News**: `2026/04/02`: We released [GLM-5V-Turbo](https://docs.z.ai/guides/vlm/glm-5v-turbo) \nand [GLM-skills](https://github.com/zai-org/GLM-skills).\n- **News**: `2026/03/28`: We have released multiple GLM-V related Skills, covering several specialized areas\nsuch as GLM-V-Grounding and GLM-V-Prompt-Gen. You are welcome to try them [here](skills).\n- **News**: `2025/11/10`: We released **UI2Code^N**, a"},{"ref":"P27","kind":"page","title":"zai-org/VisionReward repository metadata","date":"2026-06-11T01:45:57.439451+00:00","date_source":null,"source_url":"https://github.com/zai-org/VisionReward","signal_url":null,"signal_json_url":null,"text":"# zai-org/VisionReward\n\nDescription: [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation\n\nLanguage: Python\n\nLicense: Apache-2.0\n\nStars: 407\n\nForks: 14\n\nOpen issues: 19\n\nCreated: 2024-12-12T11:20:46Z\n\nPushed: 2025-03-26T10:07:04Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# **VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation**\n\n<p align=\"center\">\n📃 <a href=\"https://arxiv.org/abs/2412.21059\" target=\"_blank\">Paper</a> • 🖼 <a href=\"https://huggingface.co/datasets/THUDM/VisionRewardDB-Video\" target=\"_blank\">Dataset</a> • 🤗 <a href=\"https://huggingface.co/THUDM/VisionReward-Video\" target=\"_blank\">HF Repo</a> • 🌐 <a href=\"https://zhuanlan.zhihu.com/p/16481080277\" target=\"_blank\">中文博客</a> <br>\n</p>\n\n**VisionReward** is a fine-grained, multi-dimensional reward model designed to capture human preferences in images and videos. By breaking down subjective judgments into interpretable dimensions with weighted scoring, it delivers precise and comprehensive evaluations. Excelling in video quality prediction, VisionReward sets a new benchmark by thoroughly analyzing dynamic video features. \n\n<p align=\"center\">\n<img src=\"asset/resource/TopDemo.jpg\" width=\"90%\">\n</p>\n\n✨ **Key Highlights**:\n- **New reward model& SOTA Performance:** VisionReward, a fine-grained, multi-dimensional, interpretable reward model, achieves **64.0 (Tau)** / **72.1 (Diff)** on **[Video Preference Test Set](https://huggingface.co/datasets/THUDM/VisionRewardDB-Video/viewer/test)**, surpassing **VideoScore** by 17.2% and setting a new **state-of-the-art**! \n- **Fine-Grained Multidimensional Dataset**: A rich, high-quality dataset with detailed annotations drives VisionReward’s precise understanding of human preferences across images and videos.\n- **Multi-objective preference optimization(MPO):** Achives stable and controllable RLHF, enabling the generate model to consider and balance multiple dimensions of human preferences simultaneously.\n\n<div align=\"center\">\n<img src=\"asset/resource/OverView.png\" width=\"90%\"/> \n</div>\n\n## 🚀 Release Information\n\n### ✨ **Models**\n<table style=\"bo"},{"ref":"P28","kind":"page","title":"zai-org/ComplexFuncBench repository metadata","date":"2026-06-11T01:45:57.344318+00:00","date_source":null,"source_url":"https://github.com/zai-org/ComplexFuncBench","signal_url":null,"signal_json_url":null,"text":"# zai-org/ComplexFuncBench\n\nDescription: Complex Function Calling Benchmark.\n\nLanguage: Python\n\nStars: 180\n\nForks: 31\n\nOpen issues: 8\n\nCreated: 2025-01-16T11:32:30Z\n\nPushed: 2025-01-20T03:04:21Z\n\nDefault branch: main\n\nFork: no\n\nArchived: no\n\nREADME:\n# Complex Function Calling Benchmark (ComplexFuncBench)\n\n<p align=\"center\">\n📄<a href=\"https://arxiv.org/abs/2501.10132\" target=\"_blank\"> Arxiv Paper </a> • 🤗 <a href=\"https://huggingface.co/papers/2501.10132\" target=\"_blank\">HF Paper</a> • 📊 <a href=\"https://huggingface.co/datasets/THUDM/ComplexFuncBench\" target=\"_blank\">Dataset</a>\n</p>\n\n## Table of Contents\n\n- [Introduction](#introduction)\n- [Leaderboard](#Leaderboard)\n- [Method](#Method)\n- [How to evaluate on ComplexFuncBench](#how-to-evaluate-on-complexfuncbench)\n- [Citation](#citation)\n\n## Introduction\n\nComplex Function Calling Benchmark (`ComplexFuncBench`) is specillly designed for complex function calling evaluation. The ComplexFuncBench dataset encompass 1,000 complex function calling samples from five aspects: (1) Function calling with **multi-step** in single turn; (2) Function calling with user-provided **constraints**; (3) Function calling that requires **parameter value reasoning** from implicit information; (4) Function calling with **long parameter values** that exceed 500 tokens; and (5) Function calling with **128k long-context** length.\n\n![Complex Example](./resources/complex-example.png)\n\nThe difference between `ComplexFuncBench` and other function calling benchmarks is shown in the following table.\n\n| | Real API Response | Multi-Step | Constraints | Parameter Value Reasoning | Long Parameter Reasoning | Long-Context |\n| :----------------: | :---------------: | :--------: | :---------: | :-----------------------: | :----------------------: | :----------: |\n| API-Bench | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |\n| ToolBench | ✅ | ✅ | ❌ | ❌ | ❌ | ❌ |\n| T-Eval | ✅ | ✅ | ❌ | ❌ | ❌ | ❌ |\n| BFCL | ❌ | ✅ | ❌ | ❌ | ✅ | ✅ |\n| Tool Sandbox | ❌ | ✅ | ❌ | ❌ | ❌ | ❌ |\n| `ComplexFuncBench` | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |\n\n## Leaderboard\n\n| Model | Overall Success Rate | Overall Call Acc. | Completeness | Correctness |\n| :--------------------------- | :------------------: | :-------"},{"ref":"E1","kind":"event","title":"zai-org/GLM-5","date":"2026-02-11T04:55:46+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-5","signal_url":"https://onlylabs.fyi/signals/72ee3bf6-586a-49e7-8f29-133f219c9fb7","signal_json_url":"https://onlylabs.fyi/signals/72ee3bf6-586a-49e7-8f29-133f219c9fb7/signal.json","text":"model_released · zai-org/GLM-5 · signal_desk=releases · occurred_at=2026-02-11T04:55:46+00:00 · url=https://huggingface.co/zai-org/GLM-5 · hf_downloads=102786 · hf_likes=2093 · hf_params=753864139008 · pipeline=text-generation · license=mit"},{"ref":"E2","kind":"event","title":"zai-org/GLM-4.7","date":"2025-12-22T07:45:52+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.7","signal_url":"https://onlylabs.fyi/signals/04722f66-05e2-4ba8-862c-2ba6fba5b149","signal_json_url":"https://onlylabs.fyi/signals/04722f66-05e2-4ba8-862c-2ba6fba5b149/signal.json","text":"model_released · zai-org/GLM-4.7 · signal_desk=releases · occurred_at=2025-12-22T07:45:52+00:00 · url=https://huggingface.co/zai-org/GLM-4.7 · hf_downloads=66288 · hf_likes=2041 · hf_params=358337791296 · pipeline=text-generation · license=mit"},{"ref":"E3","kind":"event","title":"zai-org/GLM-OCR","date":"2026-01-30T04:24:21+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-OCR","signal_url":"https://onlylabs.fyi/signals/8ed36a7c-374b-4509-9f39-7362837a795c","signal_json_url":"https://onlylabs.fyi/signals/8ed36a7c-374b-4509-9f39-7362837a795c/signal.json","text":"model_released · zai-org/GLM-OCR · signal_desk=releases · occurred_at=2026-01-30T04:24:21+00:00 · url=https://huggingface.co/zai-org/GLM-OCR · hf_downloads=3607869 · hf_likes=1820 · hf_params=1325258240 · pipeline=image-text-to-text · license=mit"},{"ref":"E4","kind":"event","title":"zai-org/GLM-4.7-Flash","date":"2026-01-19T06:28:10+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.7-Flash","signal_url":"https://onlylabs.fyi/signals/4df2cc6a-f8f6-4197-9766-c8ede6451b82","signal_json_url":"https://onlylabs.fyi/signals/4df2cc6a-f8f6-4197-9766-c8ede6451b82/signal.json","text":"model_released · zai-org/GLM-4.7-Flash · signal_desk=releases · occurred_at=2026-01-19T06:28:10+00:00 · url=https://huggingface.co/zai-org/GLM-4.7-Flash · hf_downloads=1100933 · hf_likes=1745 · hf_params=31221488576 · pipeline=text-generation · license=mit"},{"ref":"E5","kind":"event","title":"zai-org/GLM-5.1","date":"2026-04-03T09:28:47+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-5.1","signal_url":"https://onlylabs.fyi/signals/f4fb4df8-c759-44a5-bc98-bac22072971f","signal_json_url":"https://onlylabs.fyi/signals/f4fb4df8-c759-44a5-bc98-bac22072971f/signal.json","text":"model_released · zai-org/GLM-5.1 · signal_desk=releases · occurred_at=2026-04-03T09:28:47+00:00 · url=https://huggingface.co/zai-org/GLM-5.1 · hf_downloads=123470 · hf_likes=1754 · hf_params=753864139008 · pipeline=text-generation · license=mit"},{"ref":"E6","kind":"event","title":"zai-org/GLM-OCR","date":"2026-02-02T12:59:43+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-OCR","signal_url":"https://onlylabs.fyi/signals/96b785f6-e53f-411c-8db5-de694472d179","signal_json_url":"https://onlylabs.fyi/signals/96b785f6-e53f-411c-8db5-de694472d179/signal.json","text":"repo_new · zai-org/GLM-OCR · signal_desk=repos · occurred_at=2026-02-02T12:59:43+00:00 · url=https://github.com/zai-org/GLM-OCR · stars=6935 · hn=302 points/75 comments · raw={\"repo\":\"zai-org/GLM-OCR\",\"description\":\"GLM-OCR: Accurate ×  Fast × Comprehensive\",\"language\":\"Python\"}"},{"ref":"E7","kind":"event","title":"zai-org/GLM-4.5","date":"2025-07-20T03:25:36+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.5","signal_url":"https://onlylabs.fyi/signals/215ba15c-649f-49a6-b687-455309a9c640","signal_json_url":"https://onlylabs.fyi/signals/215ba15c-649f-49a6-b687-455309a9c640/signal.json","text":"model_released · zai-org/GLM-4.5 · signal_desk=releases · occurred_at=2025-07-20T03:25:36+00:00 · url=https://huggingface.co/zai-org/GLM-4.5 · hf_downloads=176737 · hf_likes=1402 · hf_params=358337791296 · pipeline=text-generation · license=mit"},{"ref":"E8","kind":"event","title":"zai-org/GLM-4.6","date":"2025-09-29T18:22:51+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.6","signal_url":"https://onlylabs.fyi/signals/fddefdbd-9857-4445-accd-5de1c68743bc","signal_json_url":"https://onlylabs.fyi/signals/fddefdbd-9857-4445-accd-5de1c68743bc/signal.json","text":"model_released · zai-org/GLM-4.6 · signal_desk=releases · occurred_at=2025-09-29T18:22:51+00:00 · url=https://huggingface.co/zai-org/GLM-4.6 · hf_downloads=13553 · hf_likes=1227 · hf_params=356785898816 · pipeline=text-generation · license=mit"},{"ref":"E9","kind":"event","title":"zai-org/GLM-Image","date":"2026-01-08T09:39:56+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-Image","signal_url":"https://onlylabs.fyi/signals/63a74604-38af-4f72-a107-9ed3b5efa558","signal_json_url":"https://onlylabs.fyi/signals/63a74604-38af-4f72-a107-9ed3b5efa558/signal.json","text":"model_released · zai-org/GLM-Image · signal_desk=releases · occurred_at=2026-01-08T09:39:56+00:00 · url=https://huggingface.co/zai-org/GLM-Image · hf_downloads=7925 · hf_likes=1073 · pipeline=text-to-image · license=mit"},{"ref":"E10","kind":"event","title":"zai-org/GLM-4.1V-9B-Thinking","date":"2025-06-28T14:24:10+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.1V-9B-Thinking","signal_url":"https://onlylabs.fyi/signals/bc302628-a073-47dc-9f70-7a3305ca9f56","signal_json_url":"https://onlylabs.fyi/signals/bc302628-a073-47dc-9f70-7a3305ca9f56/signal.json","text":"model_released · zai-org/GLM-4.1V-9B-Thinking · signal_desk=releases · occurred_at=2025-06-28T14:24:10+00:00 · url=https://huggingface.co/zai-org/GLM-4.1V-9B-Thinking · hf_downloads=461239 · hf_likes=776 · hf_params=10292777472 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E11","kind":"event","title":"zai-org/GLM-4.5V","date":"2025-08-10T13:55:30+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.5V","signal_url":"https://onlylabs.fyi/signals/662ae0ee-c232-4ca0-91e3-2b36d37fb07d","signal_json_url":"https://onlylabs.fyi/signals/662ae0ee-c232-4ca0-91e3-2b36d37fb07d/signal.json","text":"model_released · zai-org/GLM-4.5V · signal_desk=releases · occurred_at=2025-08-10T13:55:30+00:00 · url=https://huggingface.co/zai-org/GLM-4.5V · hf_downloads=167014 · hf_likes=718 · hf_params=107710933120 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E12","kind":"event","title":"zai-org/GLM-4.5-Air","date":"2025-07-20T03:25:55+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.5-Air","signal_url":"https://onlylabs.fyi/signals/8a24013c-90c7-4b04-98c7-fd4c65cef757","signal_json_url":"https://onlylabs.fyi/signals/8a24013c-90c7-4b04-98c7-fd4c65cef757/signal.json","text":"model_released · zai-org/GLM-4.5-Air · signal_desk=releases · occurred_at=2025-07-20T03:25:55+00:00 · url=https://huggingface.co/zai-org/GLM-4.5-Air · hf_downloads=319702 · hf_likes=608 · hf_params=110468824832 · pipeline=text-generation · license=mit"},{"ref":"E13","kind":"event","title":"zai-org/GLM-4.6V-Flash","date":"2025-12-07T06:30:04+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.6V-Flash","signal_url":"https://onlylabs.fyi/signals/7d186301-8e5c-44c2-89e3-647fe696d67a","signal_json_url":"https://onlylabs.fyi/signals/7d186301-8e5c-44c2-89e3-647fe696d67a/signal.json","text":"model_released · zai-org/GLM-4.6V-Flash · signal_desk=releases · occurred_at=2025-12-07T06:30:04+00:00 · url=https://huggingface.co/zai-org/GLM-4.6V-Flash · hf_downloads=57876 · hf_likes=609 · hf_params=10292777472 · pipeline=image-text-to-text · license=mit"},{"ref":"E14","kind":"event","title":"zai-org/AutoGLM-Phone-9B","date":"2025-12-08T08:20:34+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/AutoGLM-Phone-9B","signal_url":"https://onlylabs.fyi/signals/80c71616-6238-4b7c-b172-7e11b7474c53","signal_json_url":"https://onlylabs.fyi/signals/80c71616-6238-4b7c-b172-7e11b7474c53/signal.json","text":"model_released · zai-org/AutoGLM-Phone-9B · signal_desk=releases · occurred_at=2025-12-08T08:20:34+00:00 · url=https://huggingface.co/zai-org/AutoGLM-Phone-9B · hf_downloads=677 · hf_likes=437 · hf_params=934400 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E15","kind":"event","title":"zai-org/GLM-4.6V","date":"2025-12-07T07:20:45+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.6V","signal_url":"https://onlylabs.fyi/signals/002914ac-5247-47e5-90c0-3d99a4be657b","signal_json_url":"https://onlylabs.fyi/signals/002914ac-5247-47e5-90c0-3d99a4be657b/signal.json","text":"model_released · zai-org/GLM-4.6V · signal_desk=releases · occurred_at=2025-12-07T07:20:45+00:00 · url=https://huggingface.co/zai-org/GLM-4.6V · hf_downloads=2987 · hf_likes=392 · hf_params=107710933120 · pipeline=image-text-to-text · license=mit"},{"ref":"E16","kind":"event","title":"zai-org/GLM-ASR-Nano-2512","date":"2025-12-09T09:07:41+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-ASR-Nano-2512","signal_url":"https://onlylabs.fyi/signals/2addea5d-e5ad-4400-9692-6dcb25cade09","signal_json_url":"https://onlylabs.fyi/signals/2addea5d-e5ad-4400-9692-6dcb25cade09/signal.json","text":"model_released · zai-org/GLM-ASR-Nano-2512 · signal_desk=releases · occurred_at=2025-12-09T09:07:41+00:00 · url=https://huggingface.co/zai-org/GLM-ASR-Nano-2512 · hf_downloads=118352 · hf_likes=370 · hf_params=2257843200 · pipeline=automatic-speech-recognition · license=mit"},{"ref":"E17","kind":"event","title":"zai-org/GLM-TTS","date":"2025-12-10T09:59:48+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-TTS","signal_url":"https://onlylabs.fyi/signals/1550a4df-6582-44e0-846d-c01934c649f9","signal_json_url":"https://onlylabs.fyi/signals/1550a4df-6582-44e0-846d-c01934c649f9/signal.json","text":"model_released · zai-org/GLM-TTS · signal_desk=releases · occurred_at=2025-12-10T09:59:48+00:00 · url=https://huggingface.co/zai-org/GLM-TTS · hf_downloads=314 · hf_likes=338 · pipeline=text-to-speech · license=mit"},{"ref":"E18","kind":"event","title":"zai-org/AutoGLM-Phone-9B-Multilingual","date":"2025-12-09T04:16:21+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/AutoGLM-Phone-9B-Multilingual","signal_url":"https://onlylabs.fyi/signals/886bc9f2-dc7b-4935-ae5e-aa1ce2bbe3b1","signal_json_url":"https://onlylabs.fyi/signals/886bc9f2-dc7b-4935-ae5e-aa1ce2bbe3b1/signal.json","text":"model_released · zai-org/AutoGLM-Phone-9B-Multilingual · signal_desk=releases · occurred_at=2025-12-09T04:16:21+00:00 · url=https://huggingface.co/zai-org/AutoGLM-Phone-9B-Multilingual · hf_downloads=658 · hf_likes=234 · hf_params=934400 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E19","kind":"event","title":"zai-org/ChatGLM-6B","date":"2023-03-13T12:00:18+00:00","date_source":"source","source_url":"https://github.com/zai-org/ChatGLM-6B","signal_url":"https://onlylabs.fyi/signals/d48bf6d2-05da-4c99-b02b-ba379621db54","signal_json_url":"https://onlylabs.fyi/signals/d48bf6d2-05da-4c99-b02b-ba379621db54/signal.json","text":"repo_new · zai-org/ChatGLM-6B · signal_desk=repos · occurred_at=2023-03-13T12:00:18+00:00 · url=https://github.com/zai-org/ChatGLM-6B · stars=41047 · raw={\"repo\":\"zai-org/ChatGLM-6B\",\"description\":\"ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型\",\"language\":\"Python\"}"},{"ref":"E20","kind":"event","title":"zai-org/Open-AutoGLM","date":"2025-12-08T09:23:44+00:00","date_source":"source","source_url":"https://github.com/zai-org/Open-AutoGLM","signal_url":"https://onlylabs.fyi/signals/fe7d57d8-d5cf-461e-9a73-256085f50a1c","signal_json_url":"https://onlylabs.fyi/signals/fe7d57d8-d5cf-461e-9a73-256085f50a1c/signal.json","text":"repo_new · zai-org/Open-AutoGLM · signal_desk=repos · occurred_at=2025-12-08T09:23:44+00:00 · url=https://github.com/zai-org/Open-AutoGLM · stars=25486 · hn=2 points/0 comments · raw={\"repo\":\"zai-org/Open-AutoGLM\",\"description\":\"An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone\",\"language\":\"Python\"}"},{"ref":"E21","kind":"event","title":"zai-org/SCAIL-2","date":"2026-06-09T07:18:27+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/SCAIL-2","signal_url":"https://onlylabs.fyi/signals/868269e7-f86e-47ba-bdf0-df7043ea921b","signal_json_url":"https://onlylabs.fyi/signals/868269e7-f86e-47ba-bdf0-df7043ea921b/signal.json","text":"model_released · zai-org/SCAIL-2 · signal_desk=releases · occurred_at=2026-06-09T07:18:27+00:00 · url=https://huggingface.co/zai-org/SCAIL-2 · hf_likes=90 · pipeline=image-to-video · license=mit"},{"ref":"E22","kind":"event","title":"zai-org/GLM-Z1-Rumination-32B-0414","date":"2025-04-13T12:06:04+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-Z1-Rumination-32B-0414","signal_url":"https://onlylabs.fyi/signals/578a3d75-074b-4a32-bf9e-a88dd6001710","signal_json_url":"https://onlylabs.fyi/signals/578a3d75-074b-4a32-bf9e-a88dd6001710/signal.json","text":"model_released · zai-org/GLM-Z1-Rumination-32B-0414 · signal_desk=releases · occurred_at=2025-04-13T12:06:04+00:00 · url=https://huggingface.co/zai-org/GLM-Z1-Rumination-32B-0414 · hf_downloads=181 · hf_likes=118 · hf_params=33141749760 · pipeline=text-generation · license=mit"},{"ref":"E23","kind":"event","title":"zai-org/SCAIL-Preview","date":"2025-12-08T12:40:27+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/SCAIL-Preview","signal_url":"https://onlylabs.fyi/signals/79dd192e-4c67-474d-b7f1-9d1ea8c12950","signal_json_url":"https://onlylabs.fyi/signals/79dd192e-4c67-474d-b7f1-9d1ea8c12950/signal.json","text":"model_released · zai-org/SCAIL-Preview · signal_desk=releases · occurred_at=2025-12-08T12:40:27+00:00 · url=https://huggingface.co/zai-org/SCAIL-Preview · hf_likes=111 · license=mit"},{"ref":"E24","kind":"event","title":"zai-org/RealVideo","date":"2025-12-11T08:04:35+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/RealVideo","signal_url":"https://onlylabs.fyi/signals/35a8289c-ba50-433f-8b0b-c9b5be5a4618","signal_json_url":"https://onlylabs.fyi/signals/35a8289c-ba50-433f-8b0b-c9b5be5a4618/signal.json","text":"model_released · zai-org/RealVideo · signal_desk=releases · occurred_at=2025-12-11T08:04:35+00:00 · url=https://huggingface.co/zai-org/RealVideo · hf_likes=104 · pipeline=any-to-any · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E25","kind":"event","title":"zai-org/ChatGLM2-6B","date":"2023-06-24T06:21:34+00:00","date_source":"source","source_url":"https://github.com/zai-org/ChatGLM2-6B","signal_url":"https://onlylabs.fyi/signals/f5c0af47-7ec5-4126-b5cb-b00eb4aa3eb5","signal_json_url":"https://onlylabs.fyi/signals/f5c0af47-7ec5-4126-b5cb-b00eb4aa3eb5/signal.json","text":"repo_new · zai-org/ChatGLM2-6B · signal_desk=repos · occurred_at=2023-06-24T06:21:34+00:00 · url=https://github.com/zai-org/ChatGLM2-6B · stars=15568 · raw={\"repo\":\"zai-org/ChatGLM2-6B\",\"description\":\"ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型\",\"language\":\"Python\"}"},{"ref":"E26","kind":"event","title":"zai-org/Glyph","date":"2025-10-25T06:19:07+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/Glyph","signal_url":"https://onlylabs.fyi/signals/6c3899af-9e45-4ca1-b127-b42c4365bccb","signal_json_url":"https://onlylabs.fyi/signals/6c3899af-9e45-4ca1-b127-b42c4365bccb/signal.json","text":"model_released · zai-org/Glyph · signal_desk=releases · occurred_at=2025-10-25T06:19:07+00:00 · url=https://huggingface.co/zai-org/Glyph · hf_downloads=891 · hf_likes=73 · hf_params=10292777472 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E27","kind":"event","title":"zai-org/ChatGLM3","date":"2023-10-26T06:22:47+00:00","date_source":"source","source_url":"https://github.com/zai-org/ChatGLM3","signal_url":"https://onlylabs.fyi/signals/b874a054-71e6-48ab-b2e5-f193ec606c57","signal_json_url":"https://onlylabs.fyi/signals/b874a054-71e6-48ab-b2e5-f193ec606c57/signal.json","text":"repo_new · zai-org/ChatGLM3 · signal_desk=repos · occurred_at=2023-10-26T06:22:47+00:00 · url=https://github.com/zai-org/ChatGLM3 · stars=13679 · raw={\"repo\":\"zai-org/ChatGLM3\",\"description\":\"ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型\",\"language\":\"Python\"}"},{"ref":"E28","kind":"event","title":"zai-org/GLM-4.1V-9B-Base","date":"2025-06-28T14:23:46+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.1V-9B-Base","signal_url":"https://onlylabs.fyi/signals/d6905e2a-b787-4bc3-a827-1f9a08eb0b0c","signal_json_url":"https://onlylabs.fyi/signals/d6905e2a-b787-4bc3-a827-1f9a08eb0b0c/signal.json","text":"model_released · zai-org/GLM-4.1V-9B-Base · signal_desk=releases · occurred_at=2025-06-28T14:23:46+00:00 · url=https://huggingface.co/zai-org/GLM-4.1V-9B-Base · hf_downloads=2085 · hf_likes=66 · hf_params=10292777472 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E29","kind":"event","title":"zai-org/CogVideo","date":"2022-05-29T06:46:18+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogVideo","signal_url":"https://onlylabs.fyi/signals/2c95c9bb-86b9-4774-bb11-571611012cd9","signal_json_url":"https://onlylabs.fyi/signals/2c95c9bb-86b9-4774-bb11-571611012cd9/signal.json","text":"repo_new · zai-org/CogVideo · signal_desk=repos · occurred_at=2022-05-29T06:46:18+00:00 · url=https://github.com/zai-org/CogVideo · stars=12775 · raw={\"repo\":\"zai-org/CogVideo\",\"description\":\"text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)\",\"language\":\"Python\"}"},{"ref":"E30","kind":"event","title":"zai-org/GLM-4.5-Base","date":"2025-07-20T05:14:19+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.5-Base","signal_url":"https://onlylabs.fyi/signals/1220b8c8-3171-4bd4-8ed1-a67d5caad1f4","signal_json_url":"https://onlylabs.fyi/signals/1220b8c8-3171-4bd4-8ed1-a67d5caad1f4/signal.json","text":"model_released · zai-org/GLM-4.5-Base · signal_desk=releases · occurred_at=2025-07-20T05:14:19+00:00 · url=https://huggingface.co/zai-org/GLM-4.5-Base · hf_downloads=312 · hf_likes=60 · hf_params=358337791296 · pipeline=text-generation · license=mit"},{"ref":"E31","kind":"event","title":"zai-org/GLM-4.5-Air-Base","date":"2025-07-20T03:24:43+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/GLM-4.5-Air-Base","signal_url":"https://onlylabs.fyi/signals/486c9814-f670-4d32-88fc-243fd64d6bbd","signal_json_url":"https://onlylabs.fyi/signals/486c9814-f670-4d32-88fc-243fd64d6bbd/signal.json","text":"model_released · zai-org/GLM-4.5-Air-Base · signal_desk=releases · occurred_at=2025-07-20T03:24:43+00:00 · url=https://huggingface.co/zai-org/GLM-4.5-Air-Base · hf_downloads=1167 · hf_likes=46 · hf_params=110468824832 · pipeline=text-generation · license=mit"},{"ref":"E32","kind":"event","title":"zai-org/CodeGeeX","date":"2022-09-17T14:06:29+00:00","date_source":"source","source_url":"https://github.com/zai-org/CodeGeeX","signal_url":"https://onlylabs.fyi/signals/92b0330d-2543-4a51-ad0c-1b40108650f7","signal_json_url":"https://onlylabs.fyi/signals/92b0330d-2543-4a51-ad0c-1b40108650f7/signal.json","text":"repo_new · zai-org/CodeGeeX · signal_desk=repos · occurred_at=2022-09-17T14:06:29+00:00 · url=https://github.com/zai-org/CodeGeeX · stars=8791 · raw={\"repo\":\"zai-org/CodeGeeX\",\"description\":\"CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)\",\"language\":\"Python\"}"},{"ref":"E33","kind":"event","title":"zai-org/SCAIL-2","date":"2026-05-28T11:38:02+00:00","date_source":"source","source_url":"https://github.com/zai-org/SCAIL-2","signal_url":"https://onlylabs.fyi/signals/48759bf3-02e0-43d9-83c5-0fbb4f8168aa","signal_json_url":"https://onlylabs.fyi/signals/48759bf3-02e0-43d9-83c5-0fbb4f8168aa/signal.json","text":"repo_new · zai-org/SCAIL-2 · signal_desk=repos · occurred_at=2026-05-28T11:38:02+00:00 · url=https://github.com/zai-org/SCAIL-2 · stars=191 · raw={\"repo\":\"zai-org/SCAIL-2\",\"description\":\"Official Implementation of SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning\",\"language\":\"Python\"}"},{"ref":"E34","kind":"event","title":"zai-org/GLM-130B","date":"2022-08-03T20:21:58+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-130B","signal_url":"https://onlylabs.fyi/signals/43f65f4a-b053-46b0-a0e6-dc64ff400a6d","signal_json_url":"https://onlylabs.fyi/signals/43f65f4a-b053-46b0-a0e6-dc64ff400a6d/signal.json","text":"repo_new · zai-org/GLM-130B · signal_desk=repos · occurred_at=2022-08-03T20:21:58+00:00 · url=https://github.com/zai-org/GLM-130B · stars=7651 · raw={\"repo\":\"zai-org/GLM-130B\",\"description\":\"GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)\",\"language\":\"Python\"}"},{"ref":"E35","kind":"event","title":"zai-org/CodeGeeX2","date":"2023-07-23T18:26:53+00:00","date_source":"source","source_url":"https://github.com/zai-org/CodeGeeX2","signal_url":"https://onlylabs.fyi/signals/5fff96c1-bdc7-4b22-a759-2ef7afdcf0e4","signal_json_url":"https://onlylabs.fyi/signals/5fff96c1-bdc7-4b22-a759-2ef7afdcf0e4/signal.json","text":"repo_new · zai-org/CodeGeeX2 · signal_desk=repos · occurred_at=2023-07-23T18:26:53+00:00 · url=https://github.com/zai-org/CodeGeeX2 · stars=7552 · raw={\"repo\":\"zai-org/CodeGeeX2\",\"description\":\"CodeGeeX2: A More Powerful Multilingual Code Generation Model\",\"language\":\"Python\"}"},{"ref":"E36","kind":"event","title":"zai-org/GLM-4","date":"2024-05-15T05:17:08+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-4","signal_url":"https://onlylabs.fyi/signals/e27f1ae7-4209-4932-8a8c-262c50be3dc9","signal_json_url":"https://onlylabs.fyi/signals/e27f1ae7-4209-4932-8a8c-262c50be3dc9/signal.json","text":"repo_new · zai-org/GLM-4 · signal_desk=repos · occurred_at=2024-05-15T05:17:08+00:00 · url=https://github.com/zai-org/GLM-4 · stars=7064 · raw={\"repo\":\"zai-org/GLM-4\",\"description\":\"GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型\",\"language\":\"Python\"}"},{"ref":"E37","kind":"event","title":"zai-org/CogVLM","date":"2023-09-18T02:12:50+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogVLM","signal_url":"https://onlylabs.fyi/signals/78f58efd-8ac8-4996-aba3-723cfef3dbc7","signal_json_url":"https://onlylabs.fyi/signals/78f58efd-8ac8-4996-aba3-723cfef3dbc7/signal.json","text":"repo_new · zai-org/CogVLM · signal_desk=repos · occurred_at=2023-09-18T02:12:50+00:00 · url=https://github.com/zai-org/CogVLM · stars=6739 · raw={\"repo\":\"zai-org/CogVLM\",\"description\":\"a state-of-the-art-level open visual language model | 多模态预训练模型\",\"language\":\"Python\"}"},{"ref":"E38","kind":"event","title":"zai-org/Synapse","date":"2026-05-13T11:06:53+00:00","date_source":"source","source_url":"https://github.com/zai-org/Synapse","signal_url":"https://onlylabs.fyi/signals/d6c9e002-b775-427b-ad28-972e3ab76500","signal_json_url":"https://onlylabs.fyi/signals/d6c9e002-b775-427b-ad28-972e3ab76500/signal.json","text":"repo_new · zai-org/Synapse · signal_desk=repos · occurred_at=2026-05-13T11:06:53+00:00 · url=https://github.com/zai-org/Synapse · stars=46 · raw={\"repo\":\"zai-org/Synapse\",\"description\":\"Self-hosted AI workspace with shareable AI teammates, shared conversations, memory, and governed access to plugins, MCP tools, and local devices.\",\"language\":\"TypeScript\"}"},{"ref":"E39","kind":"event","title":"zai-org/UI2Code_N","date":"2025-11-11T11:18:32+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/UI2Code_N","signal_url":"https://onlylabs.fyi/signals/84f04b6d-f76c-406e-a02a-0a5932c824a7","signal_json_url":"https://onlylabs.fyi/signals/84f04b6d-f76c-406e-a02a-0a5932c824a7/signal.json","text":"model_released · zai-org/UI2Code_N · signal_desk=releases · occurred_at=2025-11-11T11:18:32+00:00 · url=https://huggingface.co/zai-org/UI2Code_N · hf_downloads=225 · hf_likes=23 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E40","kind":"event","title":"zai-org/ImageMining","date":"2026-04-28T10:33:43+00:00","date_source":"source","source_url":"https://github.com/zai-org/ImageMining","signal_url":"https://onlylabs.fyi/signals/1bc1e9fc-2c53-4c6f-8c88-f46d8ae7f504","signal_json_url":"https://onlylabs.fyi/signals/1bc1e9fc-2c53-4c6f-8c88-f46d8ae7f504/signal.json","text":"repo_new · zai-org/ImageMining · signal_desk=repos · occurred_at=2026-04-28T10:33:43+00:00 · url=https://github.com/zai-org/ImageMining · stars=10 · raw={\"repo\":\"zai-org/ImageMining\"}"},{"ref":"E41","kind":"event","title":"zai-org/GLM-4.5","date":"2025-07-20T09:24:09+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-4.5","signal_url":"https://onlylabs.fyi/signals/a4d678b4-e41d-48ac-9bd1-2bcbac3a88be","signal_json_url":"https://onlylabs.fyi/signals/a4d678b4-e41d-48ac-9bd1-2bcbac3a88be/signal.json","text":"repo_new · zai-org/GLM-4.5 · signal_desk=repos · occurred_at=2025-07-20T09:24:09+00:00 · url=https://github.com/zai-org/GLM-4.5 · stars=4353 · raw={\"repo\":\"zai-org/GLM-4.5\",\"description\":\"GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models\",\"language\":\"Python\"}"},{"ref":"E42","kind":"event","title":"zai-org/VisualGLM-6B","date":"2023-04-23T09:36:56+00:00","date_source":"source","source_url":"https://github.com/zai-org/VisualGLM-6B","signal_url":"https://onlylabs.fyi/signals/ffaab0c1-97c6-4bd7-9254-a16229c22eb5","signal_json_url":"https://onlylabs.fyi/signals/ffaab0c1-97c6-4bd7-9254-a16229c22eb5/signal.json","text":"repo_new · zai-org/VisualGLM-6B · signal_desk=repos · occurred_at=2023-04-23T09:36:56+00:00 · url=https://github.com/zai-org/VisualGLM-6B · stars=4159 · raw={\"repo\":\"zai-org/VisualGLM-6B\",\"description\":\"Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型\",\"language\":\"Python\"}"},{"ref":"E43","kind":"event","title":"zai-org/WebVIA-Agent","date":"2025-11-12T04:39:14+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/WebVIA-Agent","signal_url":"https://onlylabs.fyi/signals/8c77f8d3-0796-4cbd-9363-14cd0e1a7c64","signal_json_url":"https://onlylabs.fyi/signals/8c77f8d3-0796-4cbd-9363-14cd0e1a7c64/signal.json","text":"model_released · zai-org/WebVIA-Agent · signal_desk=releases · occurred_at=2025-11-12T04:39:14+00:00 · url=https://huggingface.co/zai-org/WebVIA-Agent · hf_downloads=92 · hf_likes=20 · hf_params=10292777472 · pipeline=image-text-to-text · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E44","kind":"event","title":"zai-org/Kaleido-14B-S2V","date":"2025-10-27T08:31:26+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/Kaleido-14B-S2V","signal_url":"https://onlylabs.fyi/signals/aacda9a0-d80c-4f4f-9e5d-48471039f783","signal_json_url":"https://onlylabs.fyi/signals/aacda9a0-d80c-4f4f-9e5d-48471039f783/signal.json","text":"model_released · zai-org/Kaleido-14B-S2V · signal_desk=releases · occurred_at=2025-10-27T08:31:26+00:00 · url=https://huggingface.co/zai-org/Kaleido-14B-S2V · hf_likes=19 · license=mit · raw={\"derived_reason\":\"first-party-finetune\"}"},{"ref":"E45","kind":"event","title":"zai-org/GLM-5","date":"2026-02-09T08:17:02+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-5","signal_url":"https://onlylabs.fyi/signals/24fe5750-0998-45fd-b227-3e58096c2732","signal_json_url":"https://onlylabs.fyi/signals/24fe5750-0998-45fd-b227-3e58096c2732/signal.json","text":"repo_new · zai-org/GLM-5 · signal_desk=repos · occurred_at=2026-02-09T08:17:02+00:00 · url=https://github.com/zai-org/GLM-5 · stars=3388 · raw={\"repo\":\"zai-org/GLM-5\",\"description\":\"GLM-5: From Vibe Coding to Agentic Engineering\"}"},{"ref":"E46","kind":"event","title":"zai-org/GLM-4-Voice","date":"2024-10-24T12:12:32+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-4-Voice","signal_url":"https://onlylabs.fyi/signals/178c8a76-8c99-4863-b696-07d577ade208","signal_json_url":"https://onlylabs.fyi/signals/178c8a76-8c99-4863-b696-07d577ade208/signal.json","text":"repo_new · zai-org/GLM-4-Voice · signal_desk=repos · occurred_at=2024-10-24T12:12:32+00:00 · url=https://github.com/zai-org/GLM-4-Voice · stars=3187 · raw={\"repo\":\"zai-org/GLM-4-Voice\",\"description\":\"GLM-4-Voice | 端到端中英语音对话模型\",\"language\":\"Python\"}"},{"ref":"E47","kind":"event","title":"zai-org/GLM-OCR v0.1.5","date":"2026-04-08T06:22:18+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-OCR/releases/tag/v0.1.5","signal_url":"https://onlylabs.fyi/signals/75e7ffd2-51ae-4e6d-a779-0b41c373bc86","signal_json_url":"https://onlylabs.fyi/signals/75e7ffd2-51ae-4e6d-a779-0b41c373bc86/signal.json","text":"release · zai-org/GLM-OCR v0.1.5 · signal_desk=releases · occurred_at=2026-04-08T06:22:18+00:00 · url=https://github.com/zai-org/GLM-OCR/releases/tag/v0.1.5 · raw={\"repo\":\"zai-org/GLM-OCR\"}"},{"ref":"E48","kind":"event","title":"zai-org/CodeGeeX4","date":"2024-07-03T11:01:55+00:00","date_source":"source","source_url":"https://github.com/zai-org/CodeGeeX4","signal_url":"https://onlylabs.fyi/signals/c219c2fe-67be-4e16-bd84-e016323a5a41","signal_json_url":"https://onlylabs.fyi/signals/c219c2fe-67be-4e16-bd84-e016323a5a41/signal.json","text":"repo_new · zai-org/CodeGeeX4 · signal_desk=repos · occurred_at=2024-07-03T11:01:55+00:00 · url=https://github.com/zai-org/CodeGeeX4 · stars=2531 · raw={\"repo\":\"zai-org/CodeGeeX4\",\"description\":\"CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.\",\"language\":\"Python\"}"},{"ref":"E49","kind":"event","title":"zai-org/CogVLM2","date":"2024-05-10T09:07:11+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogVLM2","signal_url":"https://onlylabs.fyi/signals/06e5d19c-c441-429a-a1d5-1d1ed89fd029","signal_json_url":"https://onlylabs.fyi/signals/06e5d19c-c441-429a-a1d5-1d1ed89fd029/signal.json","text":"repo_new · zai-org/CogVLM2 · signal_desk=repos · occurred_at=2024-05-10T09:07:11+00:00 · url=https://github.com/zai-org/CogVLM2 · stars=2438 · raw={\"repo\":\"zai-org/CogVLM2\",\"description\":\"GPT4V-level open-source multi-modal model based on Llama3-8B\",\"language\":\"Python\"}"},{"ref":"E50","kind":"event","title":"zai-org/SSVAE","date":"2025-12-04T06:22:58+00:00","date_source":"source","source_url":"https://huggingface.co/zai-org/SSVAE","signal_url":"https://onlylabs.fyi/signals/90530351-c08d-4a44-a0dc-083dff68f9a0","signal_json_url":"https://onlylabs.fyi/signals/90530351-c08d-4a44-a0dc-083dff68f9a0/signal.json","text":"model_released · zai-org/SSVAE · signal_desk=releases · occurred_at=2025-12-04T06:22:58+00:00 · url=https://huggingface.co/zai-org/SSVAE · hf_likes=12 · license=mit"},{"ref":"E51","kind":"event","title":"zai-org/GLM-V","date":"2025-06-28T08:44:06+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-V","signal_url":"https://onlylabs.fyi/signals/fd80c955-0d17-41f4-8cd0-3dffb54f8776","signal_json_url":"https://onlylabs.fyi/signals/fd80c955-0d17-41f4-8cd0-3dffb54f8776/signal.json","text":"repo_new · zai-org/GLM-V · signal_desk=repos · occurred_at=2025-06-28T08:44:06+00:00 · url=https://github.com/zai-org/GLM-V · stars=2327 · raw={\"repo\":\"zai-org/GLM-V\",\"description\":\"GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning\",\"language\":\"Python\"}"},{"ref":"E52","kind":"event","title":"zai-org/GLM-skills","date":"2026-03-30T08:34:04+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-skills","signal_url":"https://onlylabs.fyi/signals/0530d1b0-8293-4e6b-9f76-f1a059b43c41","signal_json_url":"https://onlylabs.fyi/signals/0530d1b0-8293-4e6b-9f76-f1a059b43c41/signal.json","text":"repo_new · zai-org/GLM-skills · signal_desk=repos · occurred_at=2026-03-30T08:34:04+00:00 · url=https://github.com/zai-org/GLM-skills · stars=410 · raw={\"repo\":\"zai-org/GLM-skills\",\"description\":\"Official skills for the GLM family of models.\",\"language\":\"Python\"}"},{"ref":"E53","kind":"event","title":"zai-org/CogView","date":"2021-05-25T14:48:31+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogView","signal_url":"https://onlylabs.fyi/signals/83495e63-0f09-4704-8ceb-93045d729ca6","signal_json_url":"https://onlylabs.fyi/signals/83495e63-0f09-4704-8ceb-93045d729ca6/signal.json","text":"repo_new · zai-org/CogView · signal_desk=repos · occurred_at=2021-05-25T14:48:31+00:00 · url=https://github.com/zai-org/CogView · stars=1797 · raw={\"repo\":\"zai-org/CogView\",\"description\":\"Text-to-Image generation. The repo for NeurIPS 2021 paper \\\"CogView: Mastering Text-to-Image Generation via Transformers\\\".\",\"language\":\"Python\"}"},{"ref":"E54","kind":"event","title":"zai-org/ImageReward","date":"2023-04-01T09:04:17+00:00","date_source":"source","source_url":"https://github.com/zai-org/ImageReward","signal_url":"https://onlylabs.fyi/signals/9d46b90f-3ef6-427c-a54c-a1877590beb7","signal_json_url":"https://onlylabs.fyi/signals/9d46b90f-3ef6-427c-a54c-a1877590beb7/signal.json","text":"repo_new · zai-org/ImageReward · signal_desk=repos · occurred_at=2023-04-01T09:04:17+00:00 · url=https://github.com/zai-org/ImageReward · stars=1681 · data_radar_lanes=Evals and quality · data_radar_terms=eval · data_radar_reason=Zhipu AI (GLM) has a repo signal matching evals and quality. · raw={\"repo\":\"zai-org/ImageReward\",\"description\":\"[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation\",\"language\":\"Python\"}"},{"ref":"E55","kind":"event","title":"zai-org/Vision2Web","date":"2026-03-29T15:31:33+00:00","date_source":"source","source_url":"https://github.com/zai-org/Vision2Web","signal_url":"https://onlylabs.fyi/signals/7d1f43fb-6ff0-47a5-86c4-7222cb22ad17","signal_json_url":"https://onlylabs.fyi/signals/7d1f43fb-6ff0-47a5-86c4-7222cb22ad17/signal.json","text":"repo_new · zai-org/Vision2Web · signal_desk=repos · occurred_at=2026-03-29T15:31:33+00:00 · url=https://github.com/zai-org/Vision2Web · stars=48 · raw={\"repo\":\"zai-org/Vision2Web\",\"language\":\"Python\"}"},{"ref":"E56","kind":"event","title":"zai-org/CogAgent","date":"2023-11-28T09:28:08+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogAgent","signal_url":"https://onlylabs.fyi/signals/ebdb2c69-1053-4be0-a0a7-121965b47fe0","signal_json_url":"https://onlylabs.fyi/signals/ebdb2c69-1053-4be0-a0a7-121965b47fe0/signal.json","text":"repo_new · zai-org/CogAgent · signal_desk=repos · occurred_at=2023-11-28T09:28:08+00:00 · url=https://github.com/zai-org/CogAgent · stars=1183 · raw={\"repo\":\"zai-org/CogAgent\",\"description\":\"An open-sourced end-to-end VLM-based GUI Agent\",\"language\":\"Python\"}"},{"ref":"E57","kind":"event","title":"zai-org/CogView4","date":"2024-09-23T06:38:31+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogView4","signal_url":"https://onlylabs.fyi/signals/e52eab96-5dec-4ddb-af70-f13e518425c4","signal_json_url":"https://onlylabs.fyi/signals/e52eab96-5dec-4ddb-af70-f13e518425c4/signal.json","text":"repo_new · zai-org/CogView4 · signal_desk=repos · occurred_at=2024-09-23T06:38:31+00:00 · url=https://github.com/zai-org/CogView4 · stars=1103 · raw={\"repo\":\"zai-org/CogView4\",\"description\":\"CogView4, CogView3-Plus and CogView3(ECCV 2024)\",\"language\":\"Python\"}"},{"ref":"E58","kind":"event","title":"zai-org/GLM-TTS","date":"2025-12-06T04:50:56+00:00","date_source":"source","source_url":"https://github.com/zai-org/GLM-TTS","signal_url":"https://onlylabs.fyi/signals/7589a3bf-b65a-4ac9-bd26-c296fa2d0d09","signal_json_url":"https://onlylabs.fyi/signals/7589a3bf-b65a-4ac9-bd26-c296fa2d0d09/signal.json","text":"repo_new · zai-org/GLM-TTS · signal_desk=repos · occurred_at=2025-12-06T04:50:56+00:00 · url=https://github.com/zai-org/GLM-TTS · stars=1021 · raw={\"repo\":\"zai-org/GLM-TTS\",\"description\":\"GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning\",\"language\":\"Python\"}"},{"ref":"E59","kind":"event","title":"zai-org/SCAIL","date":"2025-12-01T06:42:54+00:00","date_source":"source","source_url":"https://github.com/zai-org/SCAIL","signal_url":"https://onlylabs.fyi/signals/1609b84d-bfd5-4243-909a-5b3ffc6f0f3c","signal_json_url":"https://onlylabs.fyi/signals/1609b84d-bfd5-4243-909a-5b3ffc6f0f3c/signal.json","text":"repo_new · zai-org/SCAIL · signal_desk=repos · occurred_at=2025-12-01T06:42:54+00:00 · url=https://github.com/zai-org/SCAIL · stars=986 · raw={\"repo\":\"zai-org/SCAIL\",\"description\":\"SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)\",\"language\":\"Python\"}"},{"ref":"E60","kind":"event","title":"zai-org/CogView2","date":"2022-04-25T10:15:47+00:00","date_source":"source","source_url":"https://github.com/zai-org/CogView2","signal_url":"https://onlylabs.fyi/signals/65508c76-4a5f-4d64-8c97-219b2d917847","signal_json_url":"https://onlylabs.fyi/signals/65508c76-4a5f-4d64-8c97-219b2d917847/signal.json","text":"repo_new · zai-org/CogView2 · signal_desk=repos · occurred_at=2022-04-25T10:15:47+00:00 · url=https://github.com/zai-org/CogView2 · stars=955 · raw={\"repo\":\"zai-org/CogView2\",\"description\":\"official code repo for paper \\\"CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers\\\"\",\"language\":\"Python\"}"}]}