WritingInclusionAI (Ant Group)InclusionAI (Ant Group)published Apr 1, 2025seen 5d

Agentic Learning

Open original ↗

Captured source

source ↗
published Apr 1, 2025seen 5dcaptured 3dhttp 200method plain

Agentic Learning | INCLUSION AI

Skip to main content Introduction ​

Agent exhibits powerful capabilities by interacting with the external environment and making decisions based on the feedback it receives from the environment. For complex problems, it is often necessary for an agent to have multi-turn interactions with the environment to reach a solution. The complexity and dynamism of environments, coupled with the necessity for multi-turn interactions, pose numerous challenges in training agents.

We introduce AgenticLearning , an open-source agent training paradigm designed to empower researchers to train and evaluate autonomous agents effectively. AgenticLearning offers a framework for multi-turn interactions with the environment, enabling models to learn how to interact with the environment and make decisions based on its feedback, thereby enhancing the models' ability to leverage the environment to solve complex problems.

Advancements Models Tools Environment Training Framework RAG-R1 Qwen2.5-7b-instruct offline retrieval online search AWorld LLaMA-Factory verl AReaL FunReason Qwen2.5-7b-Coder-instruct BFCL AWorld LLaMA-Factory verl

News ​

[2025/07/01] 🔥🔥🔥 RAG-R1 We propose RAG-R1 , a deepsearch training framework that incentivizing the search and reasoning capabilities of LLMs through multi-query parallelism.

[2025/05/16] 🔥🔥🔥 FunReason We propose FunReason , a novel framework that enhances LLMs' function calling capabilities through an automated data refinement strategy and a Self-Refinement Multiscale Loss approach.

Advancements ​

Deepsearch ​

RAG-R1 ​

Tools: Search Engines (offline or online )

LLM: Qwen2.5-7b-instruct

Overall framework of RAG-R1.

Performance comparisons on QA benchmarks under the EM metric. The best and second best results are bold and underlined, respectively.

FunctionCall ​

FunReason ​

Tools: Real Human Function calling (BFCLv2 live&non-live)

LLM: Qwen2.5-7b-Coder-instruct

FunReason is a framework designed to enhance LLMs' function calling capabilities, achieving GPT-4o-comparable performance on BFCL, surpassing RL-based methods, mitigating catastrophic forgetting on HumanEval and MBPP, and using a data refinement strategy where natural CoT data outperforms artificial ones.

Data refinement pipline of FunReason.

Overview of FunReason's data refinement pipeline. The pipeline consists of five stages: Function Call Classification, Query and Tool Identification, CoT Identification, Function and Parameter Identification, and Format Identification. Each stage ensures specific aspects of data quality, with failing examples either being discarded or regenerated.

Performance of FunReason.

Citation ​

Please cite our repo if our works are helpful for your research.

@article{RAG-R1, title={RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism}, author={Zhiwen Tan and Jiaming Huang and Qintong Wu and Hongxuan Zhang and Chenyi Zhuang and Jinjie Gu}, journal={arXiv preprint arXiv:2507.02962}, year={2025} }

@article{FunReason, title={FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement}, author={Bingguang Hao, Maolin Wang, Zengzhuang Xu, Cunyin Peng, Yicheng Chen, Xiangyu Zhao, Jinjie Gu, Chenyi Zhuang}, journal={arXiv preprint arXiv:2505.20192}, year={2025} }

Contact ​

For any question or feedback, please reach out to us at ender.tzw@antgroup.com or chenyi.zcy@antgroup.com

License ​

This project is licensed under the MIT License - see the LICENSE file for details.

Introduction News Advancements Deepsearch FunctionCall Citation

Contact License

Notability

notability 5.0/10

Blog post on agentic learning, no traction data