Skip to main navigation Skip to search Skip to main content

Localgan: Modeling local distributions for adversarial response generation

  • Baoxun Wang
  • , Zhen Xu
  • , Huan Zhang
  • , Kexin Qiu
  • , Deyuan Zhang
  • , Chengjie Sun
  • Tencent
  • Peking University
  • Columbia University
  • Shenyang Aerospace University
  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a new methodology for modeling the local semantic distribution of responses to a given query in the human-conversation corpus, and on this basis, explores a specified adversarial learning mechanism for training Neural Response Generation (NRG) models to build conversational agents. Our investigation begins with the thorough discussions upon the objective function of general Generative Adversarial Nets (GAN) architectures, and the training instability problem is proved to be highly relative with the special local distributions of conversational corpora. Consequently, an energy function is employed to estimate the status of a local area restricted by the query and its responses in the semantic space, and the mathematical approximation of this energy-based distribution is finally found. Building on this foundation, a local distribution oriented objective is proposed and combined with the original objective, working as a hybrid loss for the adversarial training of response generation models, named as LocalGAN. Our experimental results demonstrate that the reasonable local distribution modeling of the query-response corpus is of great importance to adversarial NRG, and our proposed LocalGAN is promising for improving both the training stability and the quality of generated results.

Original languageEnglish
JournalJournal of Machine Learning Research
Volume22
StatePublished - 2021
Externally publishedYes

Keywords

  • Adversarial Learning
  • Conversational Agents
  • Energy-based Distribution Modeling
  • Local Distribution
  • Neural Response Generation

Fingerprint

Dive into the research topics of 'Localgan: Modeling local distributions for adversarial response generation'. Together they form a unique fingerprint.

Cite this