Skip to main navigation Skip to search Skip to main content

Paraphrase collocation extraction based on binary classification

  • Shi Qi Zhao*
  • , Lin Zhao
  • , Ting Liu
  • , Sheng Li
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This paper addresses the problem of paraphrase collocation extraction by using "OBJ" relationship as a case study. Specifically, the proposed method recasts paraphrase collocation extraction as a binary classification problem, which combines multiple features based on translation, thesaurus, polarity words, and web mining. Experimental results show that the binary classification-based method is effective for paraphrase collocation extraction. Especially, the exploited features are all helpful for improving the extraction performance. With the proposed method, more than 280 000 pairs of paraphrase collocations are extracted, the precision of which is above 70%. Further experiments show that nearly 40% of sentences can be paraphrased by using the extracted paraphrase collocations, which demonstrates that the proposed method is useful in practice.

Original languageEnglish
Pages (from-to)1267-1276
Number of pages10
JournalRuan Jian Xue Bao/Journal of Software
Volume21
Issue number6
DOIs
StatePublished - Jun 2010
Externally publishedYes

Keywords

  • Binary classification
  • Paraphrase collocation
  • Paraphrase feature

Fingerprint

Dive into the research topics of 'Paraphrase collocation extraction based on binary classification'. Together they form a unique fingerprint.

Cite this