Skip to main navigation Skip to search Skip to main content

Semantic-based Pre-training for Dialogue Understanding

  • Xuefeng Bai*
  • , Linfeng Song
  • , Yue Zhang
  • *Corresponding author for this work
  • Westlake University
  • Tencent

Research output: Contribution to journalConference articlepeer-review

Abstract

Pre-trained language models have made great progress on dialogue tasks. However, these models are typically trained on surface dialogue text, thus are proven to be weak in understanding the main semantic meaning of a dialogue context. We investigate Abstract Meaning Representation (AMR) as explicit semantic knowledge for pre-training models to capture the core semantic information in dialogues during pre-training. In particular, we propose a semantic-based pre-training framework that extends the standard pre-training framework (Devlin et al., 2019) by three tasks for learning 1) core semantic units, 2) semantic relations and 3) the overall semantic representation according to AMR graphs. Experiments on the understanding of both chit-chats and task-oriented dialogues show the superiority of our model. To our knowledge, we are the first to leverage a deep semantic representation for dialogue pre-training.

Original languageEnglish
Pages (from-to)592-607
Number of pages16
JournalProceedings - International Conference on Computational Linguistics, COLING
Volume29
Issue number1
StatePublished - 2022
Externally publishedYes
Event29th International Conference on Computational Linguistics, COLING 2022 - Hybrid, Gyeongju, Korea, Republic of
Duration: 12 Oct 202217 Oct 2022

Fingerprint

Dive into the research topics of 'Semantic-based Pre-training for Dialogue Understanding'. Together they form a unique fingerprint.

Cite this