Skip to main navigation Skip to search Skip to main content

Text classification based on the TAN model

  • Hong Bo Shi*
  • , Zhi Hai Wang
  • , Hou Kuan Huang
  • , Li Ping Jing
  • *Corresponding author for this work
  • Beijing Jiaotong University

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper proposes a text classification method based on TAN model. Naive Bayesian classifier is the most effective and popular text classification method, but its attribute independence assumption makes it unable to express the dependence among text terms, TAN (Tree Augmented Naive Bayes) combines the simplicity of Naive Bayesian with the ability to express the dependence among attributes in Bayesian network. This paper reviews some existing text methods, introduces TAN model, and applies TAN model to text classification. Naive Bayesian and TAN classifiers are also compared by our experiments. Experimental results show TAN classifier has better performance.

Original languageEnglish
Pages43-46
Number of pages4
StatePublished - 2002
Externally publishedYes
Event2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering - Beijing, China
Duration: 28 Oct 200231 Oct 2002

Conference

Conference2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering
Country/TerritoryChina
CityBeijing
Period28/10/0231/10/02

Keywords

  • Bayesian Network
  • Feature selection
  • Naive Bayes
  • TAN
  • Text classification

Fingerprint

Dive into the research topics of 'Text classification based on the TAN model'. Together they form a unique fingerprint.

Cite this