Skip to main navigation Skip to search Skip to main content

A comparative study on clustering-based classification algorithms

  • Zhaocai Sun*
  • , Zhi Liu
  • , Yunming Ye
  • , Shengchun Deng
  • , Zhexue Huang
  • *Corresponding author for this work
  • Harbin Institute of Technology Shenzhen
  • Shandong University
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Shenzhen Institute of Advanced Technology

Research output: Contribution to journalArticlepeer-review

Abstract

For most classifiers, they are based on underlying assumptions or models. If the model or assumption matches the true data distribution, the accuracy is high, and vice versa. That is the problem of model-misfit. This paper uses clustering or cluster analysis in classification to solve the problem. For this reason, we propose a general clustering-based classification framework. In the framework, clustering algorithm is used to re-collect the data, like a filter. Thus, the diversities between classes are weakened. By that, Model-misfit is no problem any more. In our framework, almost all clustering and classification algorithms can be integrated together for the better performance. In this paper, we present an empirical study on four clustering-based classification methods. On complex data (e.g., non-linear), experimental results show that the clustering-based approach can improve the performance of the traditional classifier, especially for simple classifiers (e.g., k-NN).

Original languageEnglish
Pages (from-to)3987-3994
Number of pages8
JournalICIC Express Letters
Volume5
Issue number11
StatePublished - Nov 2011
Externally publishedYes

Keywords

  • Classification
  • Clustering
  • Decision tree
  • K-NN
  • K-means
  • Naive Bayes
  • SVM

Fingerprint

Dive into the research topics of 'A comparative study on clustering-based classification algorithms'. Together they form a unique fingerprint.

Cite this