Skip to main navigation Skip to search Skip to main content

A combined measure for text semantic similarity

  • Hao Di Li*
  • , Qing Cai Chen
  • , Xiao Long Wang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid development of artificial intelligence and natural language processing, text similarity calculation has become the core module of many applications such as semantic disambiguation, information retrieval, automatic question answering and data mining etc. Most of the existing semantic similarity algorithms are based on statistical methods or rule based methods that are conducted on ontology dictionaries and some kind of knowledge bases. Wherein the rule-based methods usually use the dictionary, the ontology tree or graph, or the co-occurrence number of attributes, while the statistical methods may choose to use or not use a knowledge base. While a statistical method of using a knowledge base incorporates more comprehensive knowledge and has the capability of reduces knowledge noise, it usually obtains better performance. Nevertheless, due to the imbalanced distribution of different items in a knowledge base, the semantic similarity calculation results for low-frequency words are usually poor.

Original languageEnglish
Title of host publicationProceedings - International Conference on Machine Learning and Cybernetics
PublisherIEEE Computer Society
Pages1869-1873
Number of pages5
ISBN (Electronic)9781479902576
DOIs
StatePublished - 2013
Externally publishedYes
Event12th International Conference on Machine Learning and Cybernetics, ICMLC 2013 - Tianjin, China
Duration: 14 Jul 201317 Jul 2013

Publication series

NameProceedings - International Conference on Machine Learning and Cybernetics
Volume4
ISSN (Print)2160-133X
ISSN (Electronic)2160-1348

Conference

Conference12th International Conference on Machine Learning and Cybernetics, ICMLC 2013
Country/TerritoryChina
CityTianjin
Period14/07/1317/07/13

Keywords

  • Combination of rule and statistical measure
  • Semantic similarity
  • Sentence level semantic similarity

Fingerprint

Dive into the research topics of 'A combined measure for text semantic similarity'. Together they form a unique fingerprint.

Cite this