Abstract
Software defect prediction can help to allocate testing resources efficiently through ranking software modules according to their defects. Existing software defect prediction models that are optimized to predict explicitly the number of defects in a software module might fail to give an accurate order because it is very difficult to predict the exact number of defects in a software module due to noisy data. This paper introduces a learning-to-rank approach to construct software defect prediction models by directly optimizing the ranking performance. In this paper, we build on our previous work, and further study whether the idea of directly optimizing the model performance measure can benefit software defect prediction model construction. The work includes two aspects: one is a novel application of the learning-to-rank approach to real-world data sets for software defect prediction, and the other is a comprehensive evaluation and comparison of the learning-to-rank method against other algorithms that have been used for predicting the order of software modules according to the predicted number of defects. Our empirical studies demonstrate the effectiveness of directly optimizing the model performance measure for the learning-to-rank approach to construct defect prediction models for the ranking task. © 1963-2012 IEEE.
Original language | English |
---|---|
Article number | 6996020 |
Pages (from-to) | 234-246 |
Number of pages | 13 |
Journal | IEEE Transactions on Reliability |
Volume | 64 |
Issue number | 1 |
Early online date | 23 Dec 2014 |
DOIs | |
Publication status | Published - Mar 2015 |
Externally published | Yes |
Funding
This work was supported in part by the 973 Program of China (Grant No. 2011CB707006), National Natural Science Foundation of China (Grants Nos. 61329302 and 61175065), the Program for New Century Excellent Talents in University (Grant No. NCET-12-0512), the Science and Technological Fund of Anhui Province for Outstanding Youth (Grant No. 1108085J16), EPSRC (Grant No. EP/J017515/1), and the European Union Seventh Framework Programme under punt agreements No. 247619 and No. 270428. The work of X. Yao was supported by a Royal Society Wolfson Research Merit Award.
Keywords
- count models
- learning-to-rank
- metric selection
- Software defect prediction
- software metrics