Skip to main navigation Skip to search Skip to main content

ITCI:An information theory based classification algorithm for incomplete data

  • Harbin Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the field of data mining, classification is an important aspect which has been studied widely. However, most of the existing studies assumed the data for classification is complete, while in practice, a lot of data with missing values exists. When dealing with these data, deleting the incomplete instances will result in a reduction of available information and filling in missing values may introduce skew and errors. To avoid the above problems, it is of great importance to study how to classify directly with incomplete data. In the paper, an information theory based classification algorithm, ITCI, is proposed. ITCI calculates the initial uncertainty of each class and attributes' contribution to decrease class uncertainty in the training stage and then, in the testing stage, an instance is assigned to the class whose uncertainty is minimum after all of the attributes are taken into consideration. Extended experiments proved the effectiveness and feasibility of the proposed method.

Original languageEnglish
Title of host publicationWeb-Age Information Management - 15th International Conference, WAIM 2014, Proceedings
PublisherSpringer Verlag
Pages167-179
Number of pages13
ISBN (Print)9783319080093
DOIs
StatePublished - 2014
Event15th International Conference on Web-Age Information Management, WAIM 2014 - Macau, China
Duration: 16 Jun 201418 Jun 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8485 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th International Conference on Web-Age Information Management, WAIM 2014
Country/TerritoryChina
CityMacau
Period16/06/1418/06/14

Keywords

  • classification
  • incomplete data
  • information theory

Fingerprint

Dive into the research topics of 'ITCI:An information theory based classification algorithm for incomplete data'. Together they form a unique fingerprint.

Cite this