大语言模型赋能软件形式化验证研究综述
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP311

基金项目:

国家自然科学基金(62302375, 62472399, 62372304, 62192734); 中国博士后科学基金(023M723736); 中央高校基本科研业务费专项资金(20103247934); 深圳市基础研究专项自然科学基金计划(JCYJ20250604184202003)


A Review on Software Formal Verification Empowered by Large Language Models
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    软件形式化验证是通过数学方法和逻辑推理确保软件系统的正确性和可靠性,广泛应用于高安全性要求领域。然而,传统形式化验证技术面临自动化程度低、推理效率不足、规模化困难等挑战,难以满足复杂软件系统快速发展的需求。近年来,大语言模型(LLMs)的快速发展为自然语言处理、代码理解与生成等领域带来了革命性突破,也为形式化验证领域提供了新的自动化解决方案。为了全面梳理和分析大语言模型赋能软件形式化验证领域的研究现状与发展趋势,本文对相关研究成果进行综述。首先,概述软件形式化验证技术的核心流程与方法,以及大语言模型在该领域应用的关键技术。进一步,聚焦LLMs在两大核心场景中的应用:1)自然语言到形式化规约的转换:通过分析LLMs如何将模糊的自然语言形式的需求自动转化为形式规约,降低形式化验证中的模型构造与性质规约转换的门槛;2)在程序验证中辅助推理与证明:探讨LLMs在定理证明器技术中辅助生成证明过程,包括通过代码分析自动提取前/后置条件与不变量等用于辅助性规约的潜力,以及在模型检测中约简状态空间或优化验证策略的可能性。最后,本文总结了大语言模型技术在软件形式化验证中面临的共性挑战与未来可能的发展方向。

    Abstract:

    Formal software verification ensures the correctness and reliability of software systems through mathematical methods and logical reasoning, and it is widely applied in domains with high safety requirements. However, traditional formal verification techniques face challenges such as low levels of automation, insufficient reasoning efficiency, and difficulties in scaling, making it hard to meet the rapidly evolving demands of complex software systems. In recent years, the rapid development of large language models (LLMs) has brought revolutionary breakthroughs in fields such as natural language processing, code understanding, and code generation, offering new automated solutions for formal verification. To systematically review and analyze the current research status and future trends of LLM-powered software formal verification, this paper provides a comprehensive survey of relevant studies. First, it outlines the core processes and methods of software formal verification, as well as the key LLM-related technologies applied in this domain. The paper then focuses on two major application scenarios of LLMs: (1) the transformation from natural language to formal specifications, which explores how LLMs can automatically convert ambiguous natural language requirements into formal specifications, thereby lowering the barriers to model construction and property specification in formal verification; (2) specification synthesis and reasoning support in program verification, which investigates the potential of LLMs to assist in proof generation within theorem prover technologies, including the automatic extraction of intermediate properties such as preconditions, postconditions, and invariants through code analysis, and their potential to optimize verification strategies or reduce state space exploration in model checking. Finally, this paper summarizes the challenges faced by LLM technologies in software formal verification and discusses potential future research directions.

    参考文献
    相似文献
    引证文献
引用本文

文成,马智,胡俊杰,王竟亦,苏杰,许智武,刘杜钢,田聪,秦胜潮,杨孟飞.大语言模型赋能软件形式化验证研究综述.软件学报,2026,37(9):

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-07
  • 最后修改日期:2025-10-28
  • 录用日期:
  • 在线发布日期: 2025-12-24
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号