Skip to main navigation Skip to search Skip to main content

Dynamic Confidence-Aware Multi-Modal Emotion Recognition

  • Qi Zhu
  • , Chuhang Zheng
  • , Zheng Zhang*
  • , Wei Shao*
  • , Daoqiang Zhang*
  • *Corresponding author for this work
  • Nanjing University of Aeronautics and Astronautics
  • School of Computer Science and Technology, Harbin Institute of Technology
  • Peng Cheng Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

Multi-modal emotion recognition has attracted increasing attention in human-computer interaction, as it extracts complementary information from physiological and behavioral features. Compared to single modal approaches, multi-modal fusion methods are more susceptible to uncertainty in emotion recognition, such as heterogeneity and inconsistent predictions across different modalities. Previous multi-modal approaches ignore systematic modeling of uncertainty in fusion and revelation of dynamic variations in emotion process. In this article, we propose a dynamic confidence-aware fusion network for robust recognition of heterogeneous emotion features, including electroencephalogram (EEG) and facial expression. First, we develop a self-attention based multi-channel LSTM network to preliminarily align the heterogeneous emotion features. Second, we propose a confidence regression network to estimate true class probability (TCP) on each modality, which helps explore the uncertainty at modality level. Then, different modalities are weighted fused according to above two types of uncertainty. Finally, we adopt self-paced learning (SPL) mechanism to further improve the model robustness by alleviating negative effect from the hard learning samples. The experimental results on several multi-modal emotion datasets demonstrate the proposed method outperforms the state-of-the-art methods in emotion recognition performance and explicitly reveals the dynamic variation of emotion with uncertainty estimation.

Original languageEnglish
Pages (from-to)1358-1370
Number of pages13
JournalIEEE Transactions on Affective Computing
Volume15
Issue number3
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • Multi-modal fusion
  • emotion recognition
  • self-attention mechanism
  • self-paced learning
  • uncertainty

Fingerprint

Dive into the research topics of 'Dynamic Confidence-Aware Multi-Modal Emotion Recognition'. Together they form a unique fingerprint.

Cite this