Skip to main navigation Skip to search Skip to main content

Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis

  • School of Computer Science and Technology, Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Modeling intra-modal and cross-modal interactions poses significant challenges in multimodal sentiment analysis. Currently, graph-based methods like HGraph-CL achieve promising performance, which rely on two different levels of graph contrastive learning within and between modalities to explore sentiment correlations. However, HGraph-CL still faces the following drawbacks in graph construction: 1) nodes of the graph are represented at the frame level, only containing low-level information, neglecting the correlations among high-level semantics; 2) edges of the graph are based on the fixed dependency relations between words in the text sequence and the adjacent relations between frame-level nodes in the non-verbal sequences, failing to effectively capture implicit and long-distance correlations. To this end, this letter introduces capsule networks to construct high-level semantic nodes in a graph, uncovering deep sentimental structures. Furthermore, the learnable adjacency matrices are employed to construct edges of graph, thus adaptively learning the relations between nodes. Experimental results on several benchmark datasets for multimodal sentiment analysis demonstrate the effectiveness of the proposed method.

Original languageEnglish
Pages (from-to)561-565
Number of pages5
JournalIEEE Signal Processing Letters
Volume31
DOIs
StatePublished - 2024
Externally publishedYes

Keywords

  • Multimodal sentiment analysis
  • capsule networks
  • cross-modal interactions
  • graph networks
  • high-level semantic correlations

Fingerprint

Dive into the research topics of 'Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis'. Together they form a unique fingerprint.

Cite this