Skip to main navigation Skip to search Skip to main content

A survey of LLM alignment: Instruction understanding, intention reasoning, and reliable dialogue generation

  • Zongyu Chang
  • , Feihong Lu
  • , Ziqin Zhu
  • , Qian Li*
  • , Cheng Ji
  • , Tao Yang
  • , Zhuo Chen
  • , Hao Peng
  • , Yang Liu
  • , Ruifeng Xu
  • , Yangqiu Song
  • , Jianxin Li
  • , Shangguang Wang
  • *Corresponding author for this work
  • Beijing University of Posts and Telecommunications
  • Beihang University
  • The University of Auckland
  • Chinese Academy of Sciences
  • Hong Kong University of Science and Technology

Research output: Contribution to journalReview articlepeer-review

Abstract

Large language models have demonstrated exceptional capabilities in understanding and generation. However, in real-world scenarios, users’ natural language expressions are often inherently fuzzy, ambiguous, and uncertain, leading to challenges such as vagueness, polysemy, and contextual ambiguity. This paper focuses on three challenges in LLM-based text generation tasks: instruction understanding, intention reasoning, and reliable dialogue generation. Regarding complex human instructions, LLMs have deficiencies in understanding long contexts and instructions in multi-round conversations. For intention reasoning, LLMs may exhibit inconsistent command reasoning, difficulty reasoning about commands containing incorrect information, difficulty understanding user ambiguous language commands, and a weak understanding of user intentions in commands. Besides, in terms of reliable dialogue generation, LLMs may produce unstable generated content and unethical outputs. To this end, we classify and analyze the performance of LLMs in challenging scenarios and conduct a comprehensive evaluation of existing solutions. Furthermore, we introduce benchmarks and categorize them based on the aforementioned three core challenges. Finally, we explore potential directions for future research to enhance the reliability and adaptability of LLMs in real-world applications.

Original languageEnglish
Article number133629
JournalNeurocomputing
Volume687
DOIs
StatePublished - 28 Jul 2026

Keywords

  • Instruction understanding
  • Intention reasoning
  • Reliable dialogue generation

Fingerprint

Dive into the research topics of 'A survey of LLM alignment: Instruction understanding, intention reasoning, and reliable dialogue generation'. Together they form a unique fingerprint.

Cite this