基于个性化联邦学习的跨项目软件缺陷预测方法

doi:10.13328/j.cnki.jos.007582

微信小程序

微信服务号

微信订阅号

首页 > 过刊浏览>2026年第37卷第7期 >. DOI:10.13328/j.cnki.jos.007582

PDF HTML阅读 XML下载导出引用引用提醒

基于个性化联邦学习的跨项目软件缺陷预测方法
DOI:
                        10.13328/j.cnki.jos.007582
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TP311
基金项目:国家自然科学基金(62077029,62277030); 江苏省教育科学规划重点项目(B-b/2024/01/47); 南京航空航天大学基本科研业务费科研基地创新基金(NJ2020022); 江苏师范大学研究生科研创新项目(2025XKT1437)

Cross-project Software Defect Prediction Method Based on Personalized Federated Learning

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对跨项目软件缺陷预测中数据隐私与项目异构性的双重挑战,本文提出一个名为PRIDE-SDP的创新框架.该框架的核心贡献在于深度融合了三种关键技术：采用个性化联邦学习范式为每个异构项目定制专属预测模型,集成提供严格数学保障的(ε,δ)-差分隐私机制保护数据不出本地,并设计了一个专用的时间-上下文融合网络（TCFN）以高效捕捉软件度量特征.在覆盖27个开源项目和3个企业项目的6个数据集组上的实验验证了本框架的有效性：与先进的跨项目缺陷预测基线相比,PRIDE-SDP平均AUC提升10.7%,F1-score提升7.3%；在企业数据集上表现更加突出,相比所有先进的基线方法,MCC平均提升45.2%,Effort@20%平均提升29.5%,F1-Score平均提升35.4%.同时,框架在提供较强隐私保障时,其平均性能保持率仍能达到最佳性能的98%以上,并且在成员推理攻击实验中能将攻击者的攻击准确率平均降低超过36%.实验结果表明,PRIDE-SDP在保持高性能的同时,有效兼顾了隐私保护与个性化适应能力.

Abstract:

To address the dual challenges of data privacy and project heterogeneity in cross-project software defect prediction, this paper proposes an innovative framework named PRIDE-SDP. The core contribution of this framework lies in the deep integration of three key technologies: adopting a personalized federated learning paradigm to customize dedicated prediction models for each heterogeneous project, integrating (ε,δ)-differential privacy mechanisms that provide rigorous mathematical guarantees to protect data locally, and designing a dedicated Temporal-Contextual Fusion Network (TCFN) to efficiently capture software metric features. Experiments on six dataset groups covering 27 open-source projects and 3 enterprise projects validate the effectiveness of this framework: compared with state-of-the-art cross-project defect prediction baselines, PRIDE-SDP achieves an average AUC improvement of 10.7% and F1-score improvement of 7.3%; performance on enterprise datasets is even more outstanding, with an average MCC improvement of 45.2%, Effort@20% improvement of 29.5%, and F1-Score improvement of 35.4% compared to advanced baseline methods. Meanwhile, when providing strong privacy guarantees, the framework's average performance retention rate can still reach over 98% of the optimal performance, and in membership inference attack experiments, it can reduce the attacker's attack accuracy by an average of more than 36%. Experimental results demonstrate that PRIDE-SDP effectively balances privacy protection and personalized adaptation capabilities while maintaining high performance.

参考文献

相似文献

引证文献

引用本文

刘子扬,祝义,周湘,李建豪,袁春鸿,郝国生.基于个性化联邦学习的跨项目软件缺陷预测方法.软件学报,2026,37(7):

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-09-05
最后修改日期:2025-10-20
录用日期:
在线发布日期: 2025-12-26
出版日期:

微信小程序

微信服务号

微信订阅号

引用本文

相关视频

分享

文章指标

历史

文章二维码