Abstract
Bracketing Transduction Grammar (BTG) has been well studied and used in statistical machine translation (SMT) with promising results. However, there are two major issues for BTG-based SMT. First, there is no effective mechanism available for predicting orders between neighboring blocks in the original BTG. Second, the computational cost is high. In this paper, we introduce two refinements for BTG-based SMT to achieve better reordering and higher-speed decoding, which include (1) reordering heuristics to prevent incorrect swapping and reduce search space, and (2) special phrases with tags to indicate sentence beginning and ending. The two refinements are integrated into a well-established BTG-based Chinese-to- English SMT system that is trained on largescale parallel data. Experimental results on the NIST MT-05 task show that the proposed refinements contribute significant improvement of 2% in BLEU score over the baseline system.
| Original language | English |
|---|---|
| Pages | 505-512 |
| Number of pages | 8 |
| State | Published - 2008 |
| Externally published | Yes |
| Event | 3rd International Joint Conference on Natural Language Processing, IJCNLP 2008 - Hyderabad, India Duration: 7 Jan 2008 → 12 Jan 2008 |
Conference
| Conference | 3rd International Joint Conference on Natural Language Processing, IJCNLP 2008 |
|---|---|
| Country/Territory | India |
| City | Hyderabad |
| Period | 7/01/08 → 12/01/08 |
Fingerprint
Dive into the research topics of 'Refinements in BTG-based Statistical Machine Translation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver