Software Defects Prediction At Method Level Using Ensemble Learning Techniques

Document Type : Original Article

Authors

1 Software Engineering Faculty of Computers and Information Technology, Egyptian E-learning University, Assuit,

2 Faculty of Computers and Information Technology Cairo-Egypt, Egyptian E-learning University

3 Faculty of Computers and Information , Assuit University

Abstract

Abstract: Creating error-free software artifacts is essential to increase software quality and potential re-usability. However, testing software artifacts to find defects and fix them is time-consuming and costly, so predicting the most error-prone software components can optimise the testing process by focusing testing resources on those components to save time and money. Much software defect prediction research has focused on higher granularities, e.g., file and package levels, and fewer have focused on the method level due to the lack of method-level  bug-related datasets . In this paper, software defect prediction will be performed on highly imbalanced  method-level datasets extracted from 23 open source Java projects . Eight ensemble learning algorithms will be applied to the datasets: Bagging, Ada-Boost, Random Forest, Random Under sampling Boost, Easy Ensemble, Balanced Bagging and Balanced Random Forest. The results showed that the Balanced Random Forest classifier achieved the best results regarding Recall and Roc_Auc values  .

Keywords