Abstract
This paper presents two pivot strategies for statistical machine transliteration, namely system-based pivot strategy and model-based pivot strategy. Given two independent source-pivot and pivot-target name pair corpora, the model-based strategy learns a direct sourcetarget transliteration model while the system-based strategy learns a sourcepivot model and a pivot-target model, respectively. Experimental results on benchmark data show that the systembased pivot strategy is effective in reducing the high resource requirement of training corpus for low-density language pairs while the model-based pivot strategy performs worse than the system-based one.
| Original language | English |
|---|---|
| Pages | 1444-1452 |
| Number of pages | 9 |
| State | Published - 2010 |
| Externally published | Yes |
| Event | 23rd International Conference on Computational Linguistics, Coling 2010 - Beijing, China Duration: 23 Aug 2010 → 27 Aug 2010 |
Conference
| Conference | 23rd International Conference on Computational Linguistics, Coling 2010 |
|---|---|
| Country/Territory | China |
| City | Beijing |
| Period | 23/08/10 → 27/08/10 |
Fingerprint
Dive into the research topics of 'Machine transliteration: Leveraging on third languages'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver