Skip to main navigation Skip to search Skip to main content

Two-stage approach to full Chinese parsing

  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Natural language parsing is a task of great importance and extreme difficulty. In this paper, we present a full Chinese parsing system based on a two-stage approach. Rather than identifying all phrases by a uniform model, we utilize a divide and conquer strategy. We propose an effective and fast method based on Markov model to identify the base phrases. Then we make the first attempt to extend one of the best English parsing models i.e. the head-driven model to recognize Chinese complex phrases. Our two-stage approach is superior to the uniform approach in two aspects. First, it creates synergy between the Markov model and the head-driven model. Second, it reduces the complexity of full Chinese parsing and makes the parsing system space and time efficient. We evaluate our approach in PARSEVAL measures on the open test set, the parsing system performances at 87.53% precision, 87.95% recall.

Original languageEnglish
Pages (from-to)359-363
Number of pages5
JournalHigh Technology Letters
Volume11
Issue number4
StatePublished - Dec 2005

Keywords

  • Markov model
  • Natural language processing systems
  • Parsing
  • Pattern recognition

Fingerprint

Dive into the research topics of 'Two-stage approach to full Chinese parsing'. Together they form a unique fingerprint.

Cite this