Skip to main navigation Skip to search Skip to main content

Hierarchical Multi-Criteria Representation Fusion for Robust Incomplete Multimodal Sentiment Analysis

  • Harbin Institute of Technology
  • Peng Cheng Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

The challenge of improving robustness to missing data in multimodal sentiment analysis (MSA) has recently attracted increasing attention. The existing work is based on sentence-level sentiment analysis, which requires extracting vector representations from input multimodal sentence sequences to perform sentiment regression or classification tasks. However, most studies focus primarily on exploring cross-modal correlations, neglecting the dynamic semantic variations across sequence frames during feature merging and fusion. In this paper, we propose the Hierarchical Multi-criteria Representation Fusion (HMRF) framework, which effectively captures dynamic semantic information across frames and modalities. Specifically, HMRF consists of two parts, i.e., sequence merging and information transfer. For sequence merging, we design a hierarchical cross-modal semantic perception mechanism and a multi-criteria feature merging mechanism, which together mitigate unimodal affective bias and progressively integrate incomplete feature sequences. Moreover, for information transfer, hierarchical information distillation and high-level semantic reconstruction are performed to transfer certain attributes and sentiment information from the complete view to the incomplete view during training. Extensive experiments on multiple benchmark datasets consistently demonstrate that the proposed HMRF outperforms existing baselines.

Original languageEnglish
Pages (from-to)290-302
Number of pages13
JournalIEEE Transactions on Affective Computing
Volume17
Issue number1
DOIs
StatePublished - 2026
Externally publishedYes

Keywords

  • Sentiment analysis
  • incomplete multimodal learning
  • information distillation
  • sequence learning

Fingerprint

Dive into the research topics of 'Hierarchical Multi-Criteria Representation Fusion for Robust Incomplete Multimodal Sentiment Analysis'. Together they form a unique fingerprint.

Cite this