基于大语言模型智能体的代码生成综述
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP311

基金项目:

国家自然科学基金 (62192733, 62192730, 62192731); 国家重点研发计划 (2023YFB4503801); 湖北省重大项目(JD2023BAA024); 北京市重大科技项目 (Z251100008425005)


Survey on Code Generation with LLM-based Agents
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    基于大语言模型的代码生成智能体正在深刻地变革软件开发范式. 相较于之前的代码生成技术, 代码生成智能体展现出3大核心特征: 首先是自主性, 智能体能独立执行从任务分解到编码、调试的完整工作流; 其次是任务范围的广泛性, 其能力从生成代码片段扩展至覆盖软件开发的全生命周期; 最后是工程实践性的增强, 研究重心从模型算法创新转向流程管理、系统可靠性与工具集成等工程挑战. 近年来, 这一技术方向发展迅猛, 展现出巨大的应用潜力, 相关研究呈爆发式增长. 为此, 对基于大语言模型的代码生成智能体领域进行系统性的综述. 追溯该技术自诞生以来的发展脉络, 全面梳理并从方法论的视角对其核心技术(涵盖单智能体与多智能体系统)进行归纳和分类. 此外, 还总结代码生成智能体在软件开发全周期中的各项应用, 整理主流的评估基准与指标, 并盘点代表性的工具. 最后, 通过分析关键挑战, 展望该领域未来的长期核心研究方向.

    Abstract:

    Code generation agents based on large language models (LLMs) are profoundly revolutionizing the software development paradigm. Compared with previous code generation techniques, code generation agents have the following three core features. The first feature is autonomy. The agents can independently execute the entire workflow from task decomposition to coding and debugging. The second is expanded task scope. The agents’ capabilities have extended from generating code snippets to encompassing the full software development life cycle (SDLC). The third is the enhancement of engineering practicality. The research focus has shifted from model algorithmic innovation toward engineering challenges such as process management, system reliability, and tool integration. In recent years, this technical domain has witnessed rapid development and demonstrated tremendous application potential, with explosive growth in related research. To this end, this study presents a systematic review of the field of LLM-based code generation agents. The technology’s developmental trajectory since its inception is traced, and its core techniques including both single-agent and multi-agent systems are sorted out and categorized. Furthermore, this study summarizes both various applications of code generation agents in the full SDLC and the mainstream evaluation benchmarks and metrics, and reviews representative tools. Finally, by analyzing the key challenges, the long-term core research directions in the future for this field are pointed out.

    参考文献
    相似文献
    引证文献
引用本文

董益宏,姜雪,钱家如,王天,张克驰,金芝,李戈.基于大语言模型智能体的代码生成综述.软件学报,2026,37(8):1-27

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-07
  • 最后修改日期:2025-10-28
  • 录用日期:
  • 在线发布日期: 2025-12-24
  • 出版日期: 2026-08-06
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号