Skip to main navigation Skip to search Skip to main content

ESED: Emotion-Specific Evidence Decomposition for Uncertainty-Aware Multimodal Emotion Recognition in Conversation

  • Zechang Xiong
  • , Zhenyan Ji
  • , Wenkang Kong
  • , Jiuqian Dai
  • , Shen Yin
  • Beijing Jiaotong University
  • Norwegian University of Science and Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Multimodal emotion recognition in conversations is inherently challenging due to ambiguous cues, modality conflicts, and temporal dynamics, all of which contribute to complex and diverse uncertainty sources. While some recent methods incorporate uncertainty modeling, they often focus on overall prediction confidence, without explicitly distinguishing the different sources of uncertainty introduced by underlying factors. To address these challenges, we propose a novel Emotion-Specific Evidence Decomposition framework (ESED) that leverages evidential deep learning to explicitly model and disentangle multimodal emotional uncertainty. Rather than directly fusing features, ESED decomposes each modality's evidence into three interpretable components: (1) emotion-consistent evidence, capturing shared emotional cues across modalities; (2) emotion-specific evidence, highlighting the unique emotional role of each modality; and (3) dynamic evidence, modeling utterance-level temporal variations. These components are adaptively weighted based on emotional intensity, ambiguity, and dynamicity, quantified via prediction entropy, inter-modal divergence, and temporal variance. The final prediction is obtained through an adaptive fusion of these weighted components. Extensive experiments demonstrate that ESED outperforms the state-of-the-art methods on the MELD and IEMOCAP datasets, demonstrating the effectiveness of our proposed method.

Original languageEnglish
Title of host publicationCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery, Inc
Pages3582-3591
Number of pages10
ISBN (Electronic)9798400720406
DOIs
StatePublished - 10 Nov 2025
Externally publishedYes
Event34th ACM International Conference on Information and Knowledge Management, CIKM 2025 - Seoul, Korea, Republic of
Duration: 10 Nov 202514 Nov 2025

Publication series

NameCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management

Conference

Conference34th ACM International Conference on Information and Knowledge Management, CIKM 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period10/11/2514/11/25

Keywords

  • emotion recognition in conversation
  • evidential deep learning
  • multimodal fusion

Fingerprint

Dive into the research topics of 'ESED: Emotion-Specific Evidence Decomposition for Uncertainty-Aware Multimodal Emotion Recognition in Conversation'. Together they form a unique fingerprint.

Cite this