Abstract
Effective methods for identification of software defects help minimize the business costs of software development. Classification methods can be used to perform software defect prediction. When cost-sensitive methods are used, the predictions are optimized for business cost. The data sets used as input for these methods typically suffer from the class imbalance problem. That is, there are many more defect-free code examples than defective code examples to learn from. This negatively impacts the classifier’s ability to correctly predict defective code examples. Cost-sensitive classification can also be used to mitigate the affects of the class imbalance problem by setting the costs to reflect the level of imbalance in the training data set. Through an experimental process, we have developed a method for combining these two different types of costs. We demonstrate that by using our proposed approach, we can produce more cost effective predictions than several recent cost-sensitive methods used for software defect prediction. Furthermore, we examine the software defect prediction models built by our method and present the discovered insights.
Original language | English |
---|---|
Title of host publication | Advanced data mining and applications |
Subtitle of host publication | 12th international conference, ADMA 2016, proceedings |
Editors | Jinyan Li, Shuliang Wang, Xue Li , Jianxin Li, Quan Z. Sheng |
Place of Publication | Switzerland |
Publisher | Springer |
Pages | 156-171 |
Number of pages | 16 |
Volume | 10086 |
ISBN (Electronic) | 9783319495866 |
ISBN (Print) | 9783319495859 |
DOIs | |
Publication status | Published - 2016 |
Event | Advanced Data Mining and Applications (ADMA) 12th International Conference - Mantra Legends Hotel, Gold Coast, Australia Duration: 12 Dec 2016 → 15 Dec 2016 https://cs.adelaide.edu.au/~adma2016/ https://cs.adelaide.edu.au/~adma2016/ADMA2016_program.pdf (Conference program) |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | Advanced Data Mining and Applications (ADMA) 12th International Conference |
---|---|
Country/Territory | Australia |
City | Gold Coast |
Period | 12/12/16 → 15/12/16 |
Other | The year 2016 marks the 12th aniversary of the International Conference on Advanced Data Mining and Applications (ADMA 2016). The conference aims at bringing together the experts on data mining from around the world, and providing a leading international forum for the dissemination of original research findings in data mining, spanning applications, algorithms, software and systems, as well as different applied disciplines with potential in data mining, such as smartphone and social network mining, bio-medical science and green computing. ADMA 2016 will promote the same close interaction and collaboration among practitioners and researchers. Published papers will go through a full peer review process. |
Internet address |