Skip to main navigation Skip to search Skip to main content

文档级神经机器翻译综述

Translated title of the contribution: Survey on Document-level Neural Machine Translation
  • Xing Lin Lü
  • , Jun Hui Li*
  • , Shi Min Tao
  • , Hao Yang
  • , Min Zhang
  • *Corresponding author for this work
  • Soochow University
  • Huawei Translation Service Center

Research output: Contribution to journalArticlepeer-review

Abstract

Machine translation (MT) aims to build an automatic translating system to transform a given sequence in the source language into another target language sequence that shares identical semantic information. MT has been an important research direction in natural language processing and artificial intelligence fields for its widely applied scenarios. In recent years, the performance of neural machine translation (NMT) greatly surpasses that of statistical machine translation (SMT), becoming the mainstream method in MT research. However, NMT generally takes the sentence as the translated unit, and in document-level translation scenarios, some discourse errors such as the mistranslation of words and incoherent sentences may occur due to the separation with discourse context if the sentence is translated independently. Therefore, incorporating document-level information into the procedure of translation may be a more reasonable and natural way to solve discourse errors. This conforms with the goal of document-level neural machine translation (DNMT) and has been a popular direction in MT research. This study reviews and summarizes works in DNMT research in terms of discourse evaluation methods, datasets and models applied, and other aspects to help the researchers efficiently learn the research status and further directions of DNMT. Meanwhile, this study also introduces the prospect and some challenges in DNMT, hoping to bring some inspiration to researchers.

Translated title of the contributionSurvey on Document-level Neural Machine Translation
Original languageChinese (Traditional)
Pages (from-to)152-183
Number of pages32
JournalRuan Jian Xue Bao/Journal of Software
Volume36
Issue number1
DOIs
StatePublished - 2025
Externally publishedYes

Fingerprint

Dive into the research topics of 'Survey on Document-level Neural Machine Translation'. Together they form a unique fingerprint.

Cite this