Skip to main navigation Skip to search Skip to main content

Syllable-based Machine Transliteration with Extra Phrase Features

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper describes our syllable-based phrase transliteration system for the NEWS 2012 shared task on English-Chinese track and its back. Grapheme-based Transliteration maps the character(s) in the source side to the target character(s) directly. However, character-based segmentation on English side will cause ambiguity in alignment step. In this paper we utilize Phrase-based model to solve machine transliteration with the mapping between Chinese characters and English syllables rather than English characters. Two heuristic rulebased syllable segmentation algorithms are applied. This transliteration model also incorporates three phonetic features to enhance discriminative ability for phrase. The primary system achieved 0.330 on Chinese-English and 0.177 on English-Chinese in terms of top-1 accuracy.

Original languageEnglish
Title of host publicationProceedings of the 2012 Named Entity Workshop, NEWS 2012 at the 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012
EditorsMin Zhang, Haizhou Li, A Kumaran
PublisherAssociation for Computational Linguistics (ACL)
Pages52-56
Number of pages5
ISBN (Electronic)9781937284404
StatePublished - 2012
Event4th Named Entity Workshop, NEWS 2012 at the 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Jeju Island, Korea, Republic of
Duration: 12 Jul 2012 → …

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN (Print)0736-587X

Conference

Conference4th Named Entity Workshop, NEWS 2012 at the 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012
Country/TerritoryKorea, Republic of
CityJeju Island
Period12/07/12 → …

Fingerprint

Dive into the research topics of 'Syllable-based Machine Transliteration with Extra Phrase Features'. Together they form a unique fingerprint.

Cite this