What does this writing signal mean?

InclusionAI (Ant Group) published Agentic Learning. This talking signal gives public context for research themes, product direction, policy, or launch framing. High-signal details: Blog post on agentic learning, no traction data · Agentic Learning | INCLUSION AI Skip to main content Introduction Agent exhibits powerful capabilities by interacting with the external environment and making.... onlylabs links this event to 1 captured evidence page and 6 related writing signals.

InclusionAI (Ant Group) Writing: Agentic Learning

Captured source

source ↗

inclusion-ai.org/inclusion-ai.org/blog/agenticlearning

Agentic Learning

Source ↗

published Apr 1, 2025seen 5dcaptured 3dhttp 200method plain

Agentic Learning | INCLUSION AI

Skip to main content Introduction

Agent exhibits powerful capabilities by interacting with the external environment and making decisions based on the feedback it receives from the environment. For complex problems, it is often necessary for an agent to have multi-turn interactions with the environment to reach a solution. The complexity and dynamism of environments, coupled with the necessity for multi-turn interactions, pose numerous challenges in training agents.

We introduce AgenticLearning , an open-source agent training paradigm designed to empower researchers to train and evaluate autonomous agents effectively. AgenticLearning offers a framework for multi-turn interactions with the environment, enabling models to learn how to interact with the environment and make decisions based on its feedback, thereby enhancing the models' ability to leverage the environment to solve complex problems.

Advancements Models Tools Environment Training Framework RAG-R1 Qwen2.5-7b-instruct offline retrieval online search AWorld LLaMA-Factory verl AReaL FunReason Qwen2.5-7b-Coder-instruct BFCL AWorld LLaMA-Factory verl

News

[2025/07/01] 🔥🔥🔥 RAG-R1 We propose RAG-R1 , a deepsearch training framework that incentivizing the search and reasoning capabilities of LLMs through multi-query parallelism.

[2025/05/16] 🔥🔥🔥 FunReason We propose FunReason , a novel framework that enhances LLMs' function calling capabilities through an automated data refinement strategy and a Self-Refinement Multiscale Loss approach.

Advancements

Deepsearch

RAG-R1

Tools: Search Engines (offline or online )

LLM: Qwen2.5-7b-instruct

Overall framework of RAG-R1.

Performance comparisons on QA benchmarks under the EM metric. The best and second best results are bold and underlined, respectively.

FunctionCall

FunReason

Tools: Real Human Function calling (BFCLv2 live&non-live)

LLM: Qwen2.5-7b-Coder-instruct

FunReason is a framework designed to enhance LLMs' function calling capabilities, achieving GPT-4o-comparable performance on BFCL, surpassing RL-based methods, mitigating catastrophic forgetting on HumanEval and MBPP, and using a data refinement strategy where natural CoT data outperforms artificial ones.

Data refinement pipline of FunReason.

Overview of FunReason's data refinement pipeline. The pipeline consists of five stages: Function Call Classification, Query and Tool Identification, CoT Identification, Function and Parameter Identification, and Format Identification. Each stage ensures specific aspects of data quality, with failing examples either being discarded or regenerated.

Performance of FunReason.

Citation

Please cite our repo if our works are helpful for your research.

@article{RAG-R1, title={RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism}, author={Zhiwen Tan and Jiaming Huang and Qintong Wu and Hongxuan Zhang and Chenyi Zhuang and Jinjie Gu}, journal={arXiv preprint arXiv:2507.02962}, year={2025} }

@article{FunReason, title={FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement}, author={Bingguang Hao, Maolin Wang, Zengzhuang Xu, Cunyin Peng, Yicheng Chen, Xiangyu Zhao, Jinjie Gu, Chenyi Zhuang}, journal={arXiv preprint arXiv:2505.20192}, year={2025} }

Contact

For any question or feedback, please reach out to us at ender.tzw@antgroup.com or chenyi.zcy@antgroup.com

License

This project is licensed under the MIT License - see the LICENSE file for details.

Introduction News Advancements Deepsearch FunctionCall Citation

Contact License

Notability

notability 5.0/10

Blog post on agentic learning, no traction data