Skip to main navigation Skip to search Skip to main content

A context-Aware topic model for statistical machine translation

  • Jinsong Su
  • , Deyi Xiong
  • , Yang Liu
  • , Xianpei Han
  • , Hongyu Lin
  • , Junfeng Yao
  • , Min Zhang
  • Xiamen University
  • Soochow University
  • Tsinghua University
  • CAS - Institute of Software

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Lexical selection is crucial for statistical machine translation. Previous studies separately exploit sentence-level contexts and documentlevel topics for lexical selection, neglecting their correlations. In this paper, we propose a context-Aware topic model for lexical selection, which not only models local contexts and global topics but also captures their correlations. The model uses target-side translations as hidden variables to connect document topics and source-side local contextual words. In order to learn hidden variables and distributions from data, we introduce a Gibbs sampling algorithm for statistical estimation and inference. A new translation probability based on distributions learned by the model is integrated into a translation system for lexical selection. Experiment results on NIST Chinese-English test sets demonstrate that 1) our model significantly outperforms previous lexical selection methods and 2) modeling correlations between local words and global topics can further improve translation quality.

Original languageEnglish
Title of host publicationACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages229-238
Number of pages10
ISBN (Electronic)9781941643723
DOIs
StatePublished - 2015
Externally publishedYes
Event53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL-IJCNLP 2015 - Beijing, China
Duration: 26 Jul 201531 Jul 2015

Publication series

NameACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference
Volume1

Conference

Conference53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL-IJCNLP 2015
Country/TerritoryChina
CityBeijing
Period26/07/1531/07/15

Fingerprint

Dive into the research topics of 'A context-Aware topic model for statistical machine translation'. Together they form a unique fingerprint.

Cite this