Skip to main navigation Skip to search Skip to main content

Machine transliteration: Leveraging on third languages

  • Min Zhang*
  • , Xiangyu Duan
  • , Vladimir Pervouchine
  • , Haizhou Li
  • *Corresponding author for this work
  • Agency for Science, Technology and Research, Singapore

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper presents two pivot strategies for statistical machine transliteration, namely system-based pivot strategy and model-based pivot strategy. Given two independent source-pivot and pivot-target name pair corpora, the model-based strategy learns a direct sourcetarget transliteration model while the system-based strategy learns a sourcepivot model and a pivot-target model, respectively. Experimental results on benchmark data show that the systembased pivot strategy is effective in reducing the high resource requirement of training corpus for low-density language pairs while the model-based pivot strategy performs worse than the system-based one.

Original languageEnglish
Pages1444-1452
Number of pages9
StatePublished - 2010
Externally publishedYes
Event23rd International Conference on Computational Linguistics, Coling 2010 - Beijing, China
Duration: 23 Aug 201027 Aug 2010

Conference

Conference23rd International Conference on Computational Linguistics, Coling 2010
Country/TerritoryChina
CityBeijing
Period23/08/1027/08/10

Fingerprint

Dive into the research topics of 'Machine transliteration: Leveraging on third languages'. Together they form a unique fingerprint.

Cite this