基于个性化联邦学习的跨项目软件缺陷预测方法
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP311

基金项目:

国家自然科学基金(62077029,62277030); 江苏省教育科学规划重点项目(B-b/2024/01/47); 南京航空航天大学基本科研业务费科研基地创新基金(NJ2020022); 江苏师范大学研究生科研创新项目(2025XKT1437)


Cross-project Software Defect Prediction Method Based on Personalized Federated Learning
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对跨项目软件缺陷预测中数据隐私与项目异构性的双重挑战,本文提出一个名为PRIDE-SDP的创新框架.该框架的核心贡献在于深度融合了三种关键技术:采用个性化联邦学习范式为每个异构项目定制专属预测模型,集成提供严格数学保障的(ε,δ)-差分隐私机制保护数据不出本地,并设计了一个专用的时间-上下文融合网络(TCFN)以高效捕捉软件度量特征.在覆盖27个开源项目和3个企业项目的6个数据集组上的实验验证了本框架的有效性:与先进的跨项目缺陷预测基线相比,PRIDE-SDP平均AUC提升10.7%,F1-score提升7.3%;在企业数据集上表现更加突出,相比所有先进的基线方法,MCC平均提升45.2%,Effort@20%平均提升29.5%,F1-Score平均提升35.4%.同时,框架在提供较强隐私保障时,其平均性能保持率仍能达到最佳性能的98%以上,并且在成员推理攻击实验中能将攻击者的攻击准确率平均降低超过36%.实验结果表明,PRIDE-SDP在保持高性能的同时,有效兼顾了隐私保护与个性化适应能力.

    Abstract:

    To address the dual challenges of data privacy and project heterogeneity in cross-project software defect prediction, this paper proposes an innovative framework named PRIDE-SDP. The core contribution of this framework lies in the deep integration of three key technologies: adopting a personalized federated learning paradigm to customize dedicated prediction models for each heterogeneous project, integrating (ε,δ)-differential privacy mechanisms that provide rigorous mathematical guarantees to protect data locally, and designing a dedicated Temporal-Contextual Fusion Network (TCFN) to efficiently capture software metric features. Experiments on six dataset groups covering 27 open-source projects and 3 enterprise projects validate the effectiveness of this framework: compared with state-of-the-art cross-project defect prediction baselines, PRIDE-SDP achieves an average AUC improvement of 10.7% and F1-score improvement of 7.3%; performance on enterprise datasets is even more outstanding, with an average MCC improvement of 45.2%, Effort@20% improvement of 29.5%, and F1-Score improvement of 35.4% compared to advanced baseline methods. Meanwhile, when providing strong privacy guarantees, the framework's average performance retention rate can still reach over 98% of the optimal performance, and in membership inference attack experiments, it can reduce the attacker's attack accuracy by an average of more than 36%. Experimental results demonstrate that PRIDE-SDP effectively balances privacy protection and personalized adaptation capabilities while maintaining high performance.

    参考文献
    相似文献
    引证文献
引用本文

刘子扬,祝义,周湘,李建豪,袁春鸿,郝国生.基于个性化联邦学习的跨项目软件缺陷预测方法.软件学报,2026,37(7):

复制
相关视频

分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-09-05
  • 最后修改日期:2025-10-20
  • 录用日期:
  • 在线发布日期: 2025-12-26
  • 出版日期:
文章二维码
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号