Skip to main navigation Skip to search Skip to main content

Improving Unsupervised Dependency Parsing with Knowledge from Query Logs

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Unsupervised dependency parsing becomes more and more popular in recent years because it does not need expensive annotations, such as treebanks, which are required for supervised and semi-supervised dependency parsing. However, its accuracy is still far below that of supervised dependency parsers, partly due to the fact that their parsing model is insufficient to capture linguistic phenomena underlying texts. The performance for unsupervised dependency parsing can be improved by mining knowledge from the texts and by incorporating it into the model. In this article, syntactic knowledge is acquired from query logs to help estimate better probabilities in dependency models with valence. The proposed method is language independent and obtains an improvement of 4.1% unlabeled accuracy on the Penn Chinese Treebank by utilizing additional dependency relations from the Sogou query logs and Baidu query logs. Morever, experiments show that the proposed model achieves improvements of 8.07% on CoNLL 2007 English using the AOL query logs. We believe query logs are useful sources of syntactic knowledge for many natural language processing (NLP) tasks.

Original languageEnglish
Article number3
JournalACM Transactions on Asian and Low-Resource Language Information Processing
Volume16
Issue number1
DOIs
StatePublished - 22 Jun 2016
Externally publishedYes

Keywords

  • Dependency parsing
  • additional knowledge
  • natural annotations
  • query logs

Fingerprint

Dive into the research topics of 'Improving Unsupervised Dependency Parsing with Knowledge from Query Logs'. Together they form a unique fingerprint.

Cite this