Skip to main navigation Skip to search Skip to main content

Semantic decomposition and enhancement hashing for deep cross-modal retrieval

  • Lunke Fei
  • , Zhihao He
  • , Wai Keung Wong*
  • , Qi Zhu
  • , Shuping Zhao
  • , Jie Wen
  • *Corresponding author for this work
  • Guangdong University of Technology
  • Hong Kong Polytechnic University
  • Nanjing University of Aeronautics and Astronautics
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Deep hashing has garnered considerable interest and has shown impressive performance in the domain of retrieval. However, the majority of the current hashing techniques rely solely on binary similarity evaluation criteria to assess the semantic relationships between multi-label instances, which presents a challenge in overcoming the feature gap across various modalities. In this paper, we propose semantic decomposition and enhancement hashing (SDEH) by extensively exploring the multi-label semantic information shared by different modalities for cross-modal retrieval. Specifically, we first introduce two independent attention-based feature learning subnetworks to capture the modality-specific features with both global and local details. Subsequently, we exploit the semantic features from multi-label vectors by decomposing the shared semantic information among multi-modal features such that the associations of different modalities can be established. Finally, we jointly learn the common hash code representations of multimodal information under the guidelines of quadruple losses, making the hash codes informative while simultaneously preserving multilevel semantic relationships and feature distribution consistency. Comprehensive experiments on four commonly used multimodal datasets offer strong support for the exceptional effectiveness of our proposed SDEH.

Original languageEnglish
Article number111225
JournalPattern Recognition
Volume160
DOIs
StatePublished - Apr 2025
Externally publishedYes

Keywords

  • Cross-modal retrieval
  • Deep cross-modal hashing
  • Multi-label semantic learning
  • Semantic decomposition

Fingerprint

Dive into the research topics of 'Semantic decomposition and enhancement hashing for deep cross-modal retrieval'. Together they form a unique fingerprint.

Cite this