Abstract
Multimedia rumor content has been widely disseminated with the rise of generative technologies. Existing rumor detection approaches typically focus independently on multimodal data (such as text and images) or social structure analysis, and only a few researchers have attempted to integrate all three modalities for comprehensive rumor detection. Due to the complexity of the relationships between these heterogeneous data, combining them effectively remains a challenge. In this work, we present a novel Graph-Enhanced Multimodal Contrastive Learning (GMCL) to integrate textual, visual, and social graph features more efficiently for rumor detection. We utilize semantic correlation to assist cross-modal contrastive learning to capture fine-grained alignment between text and image and enhance node representations through graph contrastive learning without relying on negative samples. By aligning and integrating these different representations, our method can detect rumors more accurately. Extensive experimental results show that our model outperforms current state-of-the-art methods in multimodal rumor detection.
| Original language | English |
|---|---|
| Journal | Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing |
| DOIs | |
| State | Published - 2025 |
| Externally published | Yes |
| Event | 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, India Duration: 6 Apr 2025 → 11 Apr 2025 |
Keywords
- Rumor detection
- alignment
- contrastive learning
- multimodal
Fingerprint
Dive into the research topics of 'GMCL: Graph-Enhanced Multimodal Contrastive Learning for Rumor Detection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver