基于文件粒度的多目标软件缺陷预测方法实证研究

doi:10.13328/j.cnki.jos.005604

微信服务号

微信订阅号

首页 > 过刊浏览>2019年第30卷第12期 >3694-3713. DOI:10.13328/j.cnki.jos.005604

PDF HTML阅读 XML下载导出引用引用提醒

基于文件粒度的多目标软件缺陷预测方法实证研究
DOI:
                        10.13328/j.cnki.jos.005604
                    
作者:
                        
                        
                    
作者单位:
作者简介:陈翔(1980-),男,江苏南通人,博士,副教授,CCF高级会员,主要研究领域为软件缺陷预测,软件缺陷定位,回归测试和组合测试;赵英全(1994-),男,硕士生,主要研究领域为软件缺陷预测;顾庆(1972-),男,博士,教授,博士生导师,CCF高级会员,主要研究领域为软件质量保障,分布式计算;倪超(1990-),男,博士生,主要研究领域为软件缺陷预测;王赞(1979-),男,博士,副教授,CCF专业会员,主要研究领域为软件测试优化,软件缺陷定位,软件缺陷修复.
通讯作者:陈翔,E-mail:xchencs@ntu.edu.cn
中图分类号:TP311
基金项目:国家自然科学基金（61702041，61602267，61202006）；南京大学计算机软件新技术国家重点实验室开放课题（KFKT2019B14）；广西可信软件重点实验室研究课题（kx201610）；南通市应用研究计划（JC2018134）；江苏省政府留学奖学金

Empirical Studies on Multi-objective File-level Software Defect Prediction Method

Author:

Affiliation:

Fund Project:

National Natural Science Foundation of China (61702041, 61602267, 61202006); Open Program of the State Key Laboratory for Novel Software Technology (Nanjing University) (KFKT2019B14); Guangxi Key Laboratory of Trusted Software (kx201610); Nantong Application Research Plan (JC2018134); Jiangsu Government Scholarship for Overseas Studies

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

软件缺陷预测技术通过挖掘和分析软件库训练出软件缺陷预测模型，随后利用该模型来预测出被测软件项目内的缺陷程序模块，因此可以有效地优化测试资源的分配.在基于代价感知的评测指标下，有监督学习方法与无监督学习方法之间的预测性能比较是最近的一个热门研究话题.其中在基于文件粒度的缺陷预测问题中，Yan等人最近对Yang等人考虑的无监督学习方法和有监督学习方法展开了大规模实证研究，结果表明存在一些无监督学习方法，其性能要优于有监督方法.基于来自开源社区的10个项目展开了实证研究.结果表明：在同项目缺陷预测场景中，若基于ACC评测指标，MULTI方法与最好的无监督方法和有监督方法相比，其预测性能平均有105.81%和123.84%的提高；若基于P_OPT评测指标，MULTI方法与最好的无监督方法和有监督方法相比，其预测性能平均有35.61%和38.70%的提高.在跨项目缺陷预测场景中，若基于ACC评测指标，MULTI方法与最好的无监督方法和有监督方法相比，其预测性能平均有22.42%和34.95%的提高.若基于P_OPT评测指标，MULTI方法与最好的无监督方法和有监督方法相比，其预测性能平均有11.45%和17.92%的提高.同时，基于Huang等人提出的PMI和IFA评测指标，MULTI方法的表现与代价感知的指标相比存在一定的折衷问题，但仍好于在ACC和P_OPT评测指标下表现最好的两种无监督学习方法.除此之外，将MULTI方法与最新提出的OneWay和CBS方法进行了比较，结果表明，MULTI方法在性能上仍然可以显著优于这两种方法.同时，基于F1评测指标的结果也验证了MULTI方法在预测性能上的显著优越性.最后，通过分析模型构建的时间开销，表明MULTI方法的模型构建开销对开发人员来说处于可接受的范围之内.

Abstract:

By mining software repositories, software defect prediction can construct models to predict potential defective modules of projects under testing in advance and then optimize the allocation of test resources. When considering effort-aware performance measures, the performance comparison between supervised methods and unsupervised methods has been a recent hot topic. In the recent study for file-level defect prediction problem, Yan et al. conducted empirical studies by using unsupervised and supervised methods considered by Yang et al. and obtained the conclusion that some unsupervised methods can outperform the supervised methods. The empirical studies based on 10 projects from the open source community were conducted. Final results show that under the within-project defect prediction scenario, MULTI method can improve 105.81% and 123.84% respectively on average when compared to the best unsupervised method and the best supervised method based on ACC performance measure. While MULTI method can improve 35.61% and 38.70% respectively on average when compared to the best unsupervised method and the best supervised method based on P_OPT performance measure. Under the cross- project defect prediction scenario, MULTI method can improve 22.42% and 34.95% respectively on average when compared to the best unsupervised method and the best supervised method based on ACC performance measure. While MULTI method can improve 11.45% and 17.92% respectively on average when compared to the best unsupervised method and the best supervised method based on P_OPT performance measure. Based on PMI and IFA performance measures proposed by Huang et al., it is found that MULTI method has the issue of trade-off, but it is still better than the best two unsupervised methods when considering ACC and P_OPT performance measures. Besides, MULTI method is compared with the recently proposed OneWay and CBS methods. The results show that MULTI performs significantly better than these two methods. Based on F1 performance measure, MULTI method also shows the superiority. Finally, the analysis on the time cost of the model construction shows that the overhead of MULTI method is acceptable.

参考文献

相似文献

引证文献

引用本文

陈翔,赵英全,顾庆,倪超,王赞.基于文件粒度的多目标软件缺陷预测方法实证研究.软件学报,2019,30(12):3694-3713

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2017-12-18
最后修改日期:2018-05-06
录用日期:
在线发布日期: 2019-12-05
出版日期:

微信服务号

微信订阅号

引用本文

分享

文章指标

历史