高级检索

    基于代码提交信息的方法级软件缺陷预测

    Method-Level Bug Prediction Using Code Commit Information

    • 摘要: 本文提出了基于多维度提交特征的方法级缺陷预测框架,提出一组基于代码提交信息的新特征,并结合传统代码及历史特征构建更全面的多维度特征空间,以此构建的模型在17个开源项目上性能显著优于现有技术;通过SHAP特征重要性分析证实提交特征具有卓越的预测能力,增强模型的可解释性;基于识别关键特征进一步简化模型,兼顾了效率与精度。实验结果表明,融合提交信息的模型基于AUC、F1与MCC性能指标分别提升了4.3%、8.4%与17.7%。

       

      Abstract: Software bug prediction is a vital aspect of software quality assurance and has become a key research area in software engineering. However, current prediction technologies face two main challenges: First, coarse-grained bug prediction often fails to meet the practical needs of industry. Second, existing models have limited adaptability to dynamic development processes and rely heavily on static code features and historical data, making it difficult to effectively capture code changes and commit information. To tackle these issues, this paper presents a method-level bug prediction framework that utilizes multi-dimensional commit features to improve prediction accuracy. The primary innovation lies in introducing a novel set of features derived from code commit information, which are combined with traditional code and historical features to create a more comprehensive feature space. This model significantly outperforms existing technologies across 17 open-source projects. SHAP-based feature importance analysis further confirms that the commit features possess strong predictive capabilities while enhancing model interpretability. By identifying key features, the model is streamlined without compromising efficiency or accuracy. Experimental results show that incorporating code commit information increases AUC value by an average of 4.3%, F1 score by 8.4%, and MCC value by 17.7%.

       

    /

    返回文章
    返回