Skip to content

ICT-FinD-Lab/Awesome-LLMs-for-AI-Research

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Awesome LLMs for AI Research 🤗

中文WebsitePaperGitHub

Awesome Badge Visitors

AI4AIR Figure

🎯 Overview

A curated companion repository for AI4AIR: A Comprehensive Survey on Large Language Models for AI Research, focusing on how LLMs support data engineering, model design and optimization, model evaluation, and closed-loop AI research automation.

🔥 Update Records

  • [2026/06/01] Initial framework released with the AI4AIR survey PDF, bilingual README, and project page.
  • Coming soon: Curated paper entries, topic tags, and resource metadata.

📚 Table of Contents

📄 Abstract

Language-mediated automation is beginning to complement the human-centered trial-and-error process in AI research. Among current AI tools, large language models (LLMs) have become a central interface for generation, knowledge synthesis, and reasoning in research workflows. While LLMs are now widely used to support general scientific workflows such as literature review and scientific writing, their specific roles and deeper contributions to the core lifecycle of AI research itself remain insufficiently explored in a systematic manner. To bridge this gap, this survey introduces AI4AIR (short for AI for AI Research), which comprehensively reviews LLMs as pivotal components within machine learning research pipelines. We construct a structured two-dimensional taxonomy. One axis spans major research domains including natural language processing, computer vision, data mining, and general machine learning. The other follows the research pipeline stages, encompassing data engineering, model design and optimization, model evaluation, and the cross-stage closed-loop automation that connects them. Within this framework, we identify five recurring roles of LLMs, namely annotator, synthesizer, optimizer, evaluator, and orchestrator, through which LLMs contribute to AI research workflows. We further discuss bottlenecks such as contamination, hallucination, and reliability under feedback-driven use, and outline future directions for improving both the efficiency and the reliability of AI research and discovery.

🧭 Taxonomy of AI4AIR

Taxonomy of AI4AIR

Fig. 2. A two-dimensional taxonomy of AI4AIR. The domain-oriented view summarizes representative AI sub-domains and tasks, while the pipeline-oriented view maps recurring LLM roles to ML-centered pipeline stages.

🔗 Resources

🤝 How to Contribute

Contributions are welcome. When suggesting a paper or resource, please include:

  • Paper title and link
  • Code, project page, or dataset link if available
  • Suggested category or topic tag
  • A short reason for inclusion

Issues and pull requests are also welcome for missing references, metadata corrections, and project-page improvements.

📝 Citation

If this survey or repository is useful for your research, please cite the survey. The formal citation will be updated after the archival version is available.

@article{ao2026ai4air,
  title   = {AI4AIR: A Comprehensive Survey on Large Language Models for AI Research},
  author  = {Ao, Xiang and Lian, Junhong and Li, Hanyang and Wang, Siyi and Qiao, Yiran and Qiao, Yi and Xu, Jiaqi and He, Qing and Cheng, Xueqi},
  year    = {2026},
  note    = {Preprint, under review}
}

📈 Star History

Star History Chart

📧 Contact

If you have any questions or suggestions, please contact us via:

👨‍👨‍👧‍👦 Maintainers


⭐ If this project helps you, please give us a Star!

Made with ❤️ by FinD Lab @ ICT, CAS

About

Awesome-LLMs-for-AI-Research is a collection of state-of-the-art, novel, and representative works on large language models for AI research, covering data engineering, model design and optimization, model evaluation, and AI research automation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors