Identifying effective test cases through K-means clustering for enhancing regression testing

Yulei Pang, Xiaozhen Xue, Akbar Siami Namin

Research output: Contribution to conferencePaper

17 Scopus citations

Abstract

Testing is the most time consuming and expensive process in the software development life cycle. In order to reduce the cost of regression testing, we propose a test case classification methodology based on k-means clustering with the purpose of classifying test cases into two groups of effective and non-effective test cases. The clustering strategy is based on Hamming distances measured over the differences between coverage information obtained for current and the previous releases of the program under test. Our empirical study shows that the clustering-based test case classification can identify effective test cases with high recall ratio and considerable accuracy percentage. The paper also investigates and compares the performance of the proposed clustering-based approach with various factors including coverage criteria and the weights factor used in measuring distances.

Original languageEnglish
Pages78-83
Number of pages6
DOIs
StatePublished - 2013
Event2013 12th International Conference on Machine Learning and Applications, ICMLA 2013 - Miami, FL, United States
Duration: Dec 4 2013Dec 7 2013

Conference

Conference2013 12th International Conference on Machine Learning and Applications, ICMLA 2013
CountryUnited States
CityMiami, FL
Period12/4/1312/7/13

Keywords

  • Test case classification
  • k-means clustering
  • regression testing

Fingerprint Dive into the research topics of 'Identifying effective test cases through K-means clustering for enhancing regression testing'. Together they form a unique fingerprint.

Cite this