Skip to main navigation Skip to search Skip to main content

Refinements in BTG-based Statistical Machine Translation

  • Deyi Xiong
  • , Min Zhang
  • , Aiti Aw
  • , Haitao Mi
  • , Qun Liu
  • , Shouxun Lin
  • Agency for Science, Technology and Research, Singapore
  • CAS - Institute of Computing Technology

Research output: Contribution to conferencePaperpeer-review

Abstract

Bracketing Transduction Grammar (BTG) has been well studied and used in statistical machine translation (SMT) with promising results. However, there are two major issues for BTG-based SMT. First, there is no effective mechanism available for predicting orders between neighboring blocks in the original BTG. Second, the computational cost is high. In this paper, we introduce two refinements for BTG-based SMT to achieve better reordering and higher-speed decoding, which include (1) reordering heuristics to prevent incorrect swapping and reduce search space, and (2) special phrases with tags to indicate sentence beginning and ending. The two refinements are integrated into a well-established BTG-based Chinese-to- English SMT system that is trained on largescale parallel data. Experimental results on the NIST MT-05 task show that the proposed refinements contribute significant improvement of 2% in BLEU score over the baseline system.

Original languageEnglish
Pages505-512
Number of pages8
StatePublished - 2008
Externally publishedYes
Event3rd International Joint Conference on Natural Language Processing, IJCNLP 2008 - Hyderabad, India
Duration: 7 Jan 200812 Jan 2008

Conference

Conference3rd International Joint Conference on Natural Language Processing, IJCNLP 2008
Country/TerritoryIndia
CityHyderabad
Period7/01/0812/01/08

Fingerprint

Dive into the research topics of 'Refinements in BTG-based Statistical Machine Translation'. Together they form a unique fingerprint.

Cite this