A curated companion repository for AI4AIR: A Comprehensive Survey on Large Language Models for AI Research, focusing on how LLMs support data engineering, model design and optimization, model evaluation, and closed-loop AI research automation.
- [2026/06/01] Initial framework released with the AI4AIR survey PDF, bilingual README, and project page.
- Coming soon: Curated paper entries, topic tags, and resource metadata.
- Overview
- Abstract
- Taxonomy of AI4AIR
- Resources
- How to Contribute
- Citation
- Star History
- Contact
- Maintainers
Language-mediated automation is beginning to complement the human-centered trial-and-error process in AI research. Among current AI tools, large language models (LLMs) have become a central interface for generation, knowledge synthesis, and reasoning in research workflows. While LLMs are now widely used to support general scientific workflows such as literature review and scientific writing, their specific roles and deeper contributions to the core lifecycle of AI research itself remain insufficiently explored in a systematic manner. To bridge this gap, this survey introduces AI4AIR (short for AI for AI Research), which comprehensively reviews LLMs as pivotal components within machine learning research pipelines. We construct a structured two-dimensional taxonomy. One axis spans major research domains including natural language processing, computer vision, data mining, and general machine learning. The other follows the research pipeline stages, encompassing data engineering, model design and optimization, model evaluation, and the cross-stage closed-loop automation that connects them. Within this framework, we identify five recurring roles of LLMs, namely annotator, synthesizer, optimizer, evaluator, and orchestrator, through which LLMs contribute to AI research workflows. We further discuss bottlenecks such as contamination, hallucination, and reliability under feedback-driven use, and outline future directions for improving both the efficiency and the reliability of AI research and discovery.
Fig. 2. A two-dimensional taxonomy of AI4AIR. The domain-oriented view summarizes representative AI sub-domains and tasks, while the pipeline-oriented view maps recurring LLM roles to ML-centered pipeline stages.
- Survey PDF: AI4AIR: A Comprehensive Survey on Large Language Models for AI Research
- Project Page: Awesome LLMs for AI Research
Contributions are welcome. When suggesting a paper or resource, please include:
- Paper title and link
- Code, project page, or dataset link if available
- Suggested category or topic tag
- A short reason for inclusion
Issues and pull requests are also welcome for missing references, metadata corrections, and project-page improvements.
If this survey or repository is useful for your research, please cite the survey. The formal citation will be updated after the archival version is available.
@article{ao2026ai4air,
title = {AI4AIR: A Comprehensive Survey on Large Language Models for AI Research},
author = {Ao, Xiang and Lian, Junhong and Li, Hanyang and Wang, Siyi and Qiao, Yiran and Qiao, Yi and Xu, Jiaqi and He, Qing and Cheng, Xueqi},
year = {2026},
note = {Preprint, under review}
}If you have any questions or suggestions, please contact us via:
- Email: Xiang Ao (aoxiang@ict.ac.cn)
- 🐛 Issues: GitHub Issues
- Junhong Lian@T-Atlas
⭐ If this project helps you, please give us a Star!
Made with ❤️ by FinD Lab @ ICT, CAS

