Skip to main navigation Skip to search Skip to main content

Discrimination analysis for predicting defect-prone software modules

  • Ying Ma
  • , Ke Qin
  • , Shunzhi Zhu*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Software defect prediction studies usually build models without analyzing the data used in the procedure. As a result, the same approach has different performances on different data sets. In this paper, we introduce discrimination analysis for providing a good method to give insight into the inherent property of the software data. Based on the analysis, we find that the data sets used in this field have nonlinearly separable and class-imbalanced problems. Unlike the prior works, we try to exploit the kernel method to nonlinearly map the data into a high-dimensional feature space. By combating these two problems, we propose an algorithm based on kernel discrimination analysis called KDC to build more effective prediction model. Experimental results on the data sets from different organizations indicate that KDC is more accurate in terms of F-measure than the state-of-the-art methods. We are optimistic that our discrimination analysis method can guide more studies on data structure, which may derive useful knowledge from data science for building more accurate prediction models.

Original languageEnglish
Article number675368
JournalJournal of Applied Mathematics
Volume2014
DOIs
StatePublished - 2014
Externally publishedYes

Fingerprint

Dive into the research topics of 'Discrimination analysis for predicting defect-prone software modules'. Together they form a unique fingerprint.

Cite this