What does this repo signal mean?

InclusionAI (Ant Group) published inclusionAI/AudioMCQ (Python). This repository signal exposes tooling, eval, infrastructure, or model-adjacent work before it may appear in a launch post. High-signal details: repo inclusionAI/AudioMCQ · language Python · New repo with modest stars, not notable. onlylabs links this event to 1 captured evidence page and 6 related repo signals.

InclusionAI (Ant Group) Repo: inclusionAI/AudioMCQ

Captured source

source ↗

GitHub/github.com/inclusionAI/AudioMCQ

inclusionAI/AudioMCQ repository metadata

Source ↗

published Nov 14, 2025seen Jun 5captured Jun 11http 200method plain

inclusionAI/AudioMCQ

Description: [ICLR 2026] AudioMCQ: A 571k audio multiple-choice question dataset for post-training Large Audio Language Models with dual CoT annotations and audio-contribution filtering. 🏆 1st place in DCASE 2025 Challenge.

Language: Python

License: Apache-2.0

Stars: 51

Forks: 4

Open issues: 1

Created: 2025-11-14T08:06:09Z

Pushed: 2026-04-21T15:34:43Z

Default branch: main

Fork: no

Archived: no

README:

[ICLR 2026] AudioMCQ: Audio Multiple-Choice Question Dataset

Official repository for the paper "Measuring Audio's Impact on Correctness: Audio-Contribution-Aware Post-Training of Large Audio Language Models"

News

[2026.04] Update on MMSU Metric of released models: Based on community feedback, we identified a flaw in our evaluation script that artificially inflated the MMSU scores of our released models by ignoring sequence order. We sincerely apologize for this oversight. Crucially, please note that our AudioMCQ training data, the paper's conclusions regarding audio-contribution, and the MMAR/MMAU metrics remain completely unaffected. When comparing against our work, we recommend reporting the MMAR/MMAU results or re-evaluating our published checkpoints using your own exact-match algorithm. We deeply apologize for any inconvenience this may have caused to the research community.
[2026.03] 🔥 We released [AudioMCQ-StrongAC-GeminiCoT](https://huggingface.co/datasets/Harland/AudioMCQ-StrongAC-GeminiCoT), a highly curated subset featuring native CoT reasoning from Gemini 3.1 Pro! It proudly serves as the official training set for [DCASE 2026 Challenge Task 5](https://dcase.community/challenge2026/index#task5).
[2026.02] Paper accepted by ICLR 2026.
[2025.09] Paper published on arXiv.
[2025.09] AudioMCQ dataset released with 571k samples!
[2025.07] We achieve [1st place](https://dcase.community/challenge2025/task-audio-question-answering-results) in the DCASE 2025 Audio-Question-Answering challenge by using AudioMCQ!

Quick Links

Dataset: https://huggingface.co/datasets/inclusionAI/AudioMCQ
Paper: https://arxiv.org/abs/2509.21060
DCASE 2025 Challenge: 1st Place Results

Overview

AudioMCQ is a comprehensive audio multiple-choice question dataset with 571k samples designed for post-training Large Audio Language Models (LALMs). The dataset features dual chain-of-thought annotations and audio-contribution filtering, achieving state-of-the-art results in audio understanding tasks.

Key Highlights

571k high-quality samples across sound, music, speech, and temporal domains
Dual CoT annotations: Structured and unstructured reasoning paths
Audio-Contribution filtering: Weak (54.8%) and strong (45.2%) splits
Pre-trained models available: Weak-to-Strong and Mixed-to-Strong paradigms

Dataset Access

For complete dataset information, statistics, data format, and download instructions, please visit:

Hugging Face Dataset Repository

The Hugging Face repository contains:

Full dataset documentation
Detailed statistics and examples
Data format specifications
Download links for audio files
Usage instructions
Model checkpoints

Model Checkpoints

We provide trained model checkpoints for two post-training paradigms:

Training Scripts

All training code used for this project can be found in the /training_scripts directory.

Contact

Haolin He: [harlandzzc@link.cuhk.edu.hk](mailto:harlandzzc@link.cuhk.edu.hk)

Contributors

Citation

If you find AudioMCQ useful in your research, please cite:

@article{he2025measuring,
title={Measuring Audio's Impact on Correctness: Audio-Contribution-Aware Post-Training of Large Audio Language Models},
author={He, Haolin and Du, Xingjian and Sun, Renhe and Dai, Zheqi and Xiao, Yujia and Yang, Mingru and Zhou, Jiayi and Li, Xiquan and Liu, Zhengxi and Liang, Zining and others},
journal={arXiv preprint arXiv:2509.21060},
year={2025}
}

Acknowledgements

We thank the organizers of DCASE 2025 and the research community for their valuable feedback and support.

Related Resources

Notability

notability 4.0/10

New repo with modest stars, not notable