Skip to main navigation Skip to search Skip to main content

Auto-POS templates and mixed metrics for recognizing terms in scientific literature

  • Hongliang You*
  • , Wei Zhang
  • , Junyi Shen
  • , Yang Yu
  • , Ting Liu
  • *Corresponding author for this work
  • Xi'an Jiaotong University
  • Beijing Document Service

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Automatic Term Recognition (ATR) is an important task for Knowledge Acquisition, which aims at acquiring formalized words which are not recorded in time in the glossary. In recent years, several statistical methods has proved to be effective, and emerging methods such as C-value, NC-Value, TermExtractor has shown great advantages on this task. However, few works have been done on the Metric mixing algorithm that combines those metrics as a whole. In this paper, we first collect part-of-speech tern plates from already-known terms automatically. namely Auto-POS templates. instead of artificial regular expressions, and then we match them with POS strings to acquire candidate terms. Finally we sort those candidates by metric mixing algorithm. Experimental results on IEEE2006-2007 metadata show that the metric mixing algorithm performs better than any separate metrics alone.

Original languageEnglish
Title of host publication2010 3rd International Symposium on Knowledge Acquisition and Modeling, KAM 2010
Pages84-87
Number of pages4
DOIs
StatePublished - 2010
Event2010 3rd International Symposium on Knowledge Acquisition and Modeling, KAM 2010 - Wuhan, China
Duration: 20 Oct 201021 Oct 2010

Publication series

Name2010 3rd International Symposium on Knowledge Acquisition and Modeling, KAM 2010

Conference

Conference2010 3rd International Symposium on Knowledge Acquisition and Modeling, KAM 2010
Country/TerritoryChina
CityWuhan
Period20/10/1021/10/10

Keywords

  • Automatic term recognition
  • Information extraction
  • Knowledge acquisition
  • Text mining

Fingerprint

Dive into the research topics of 'Auto-POS templates and mixed metrics for recognizing terms in scientific literature'. Together they form a unique fingerprint.

Cite this