TY - JOUR
T1 - Cost-sensitive learning for defect escalation
AU - Sheng, Victor S.
AU - Gu, Bin
AU - Fang, Wei
AU - Wu, Jian
N1 - Funding Information:
The authors would like to thank the anonymous reviewers for their insightful and constructive comments and suggestions that have helped improve the quality of this paper. This research has been supported by the US National Science Foundation ( IIS-1115417 ).
PY - 2014/8
Y1 - 2014/8
N2 - While most software defects (i.e., bugs) are corrected and tested as part of the prolonged software development cycle, enterprise software venders often have to release software products before all reported defects are corrected, due to deadlines and limited resources. A small number of these reported defects will be escalated by customers whose businesses are seriously impacted. Escalated defects must be resolved immediately and individually by the software vendors at a very high cost. The total costs can be even greater, including loss of reputation, satisfaction, loyalty, and repeat revenue. In this paper, we develop a Software defecT Escalation Prediction (STEP) system to mine historical defect report data and predict the escalation risk of current defect reports for maximum net profit. More specifically, we first describe a simple and general framework to convert the maximum net profit problem to cost-sensitive learning. We then apply and compare four well-known cost-sensitive learning approaches for STEP. Our experiments suggest that cost-sensitive decision trees (CSTree) is the best methods for producing the highest positive net profit.
AB - While most software defects (i.e., bugs) are corrected and tested as part of the prolonged software development cycle, enterprise software venders often have to release software products before all reported defects are corrected, due to deadlines and limited resources. A small number of these reported defects will be escalated by customers whose businesses are seriously impacted. Escalated defects must be resolved immediately and individually by the software vendors at a very high cost. The total costs can be even greater, including loss of reputation, satisfaction, loyalty, and repeat revenue. In this paper, we develop a Software defecT Escalation Prediction (STEP) system to mine historical defect report data and predict the escalation risk of current defect reports for maximum net profit. More specifically, we first describe a simple and general framework to convert the maximum net profit problem to cost-sensitive learning. We then apply and compare four well-known cost-sensitive learning approaches for STEP. Our experiments suggest that cost-sensitive decision trees (CSTree) is the best methods for producing the highest positive net profit.
KW - Cost-sensitive learning
KW - Data mining
KW - Defect escalation
KW - Machine learning
KW - Software defect escalation prediction
UR - http://www.scopus.com/inward/record.url?scp=84902359961&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2014.04.033
DO - 10.1016/j.knosys.2014.04.033
M3 - Article
AN - SCOPUS:84902359961
SN - 0950-7051
VL - 66
SP - 146
EP - 155
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
ER -