Skip to main navigation Skip to search Skip to main content

Supervised Distributed Hashing for Large-Scale Multimedia Retrieval

  • School of Computer Science and Technology, Harbin Institute of Technology
  • Tsinghua University
  • National Institute of Informatics
  • Peking University

Research output: Contribution to journalArticlepeer-review

Abstract

Recent years have witnessed the growing popularity of hashing for large-scale multimedia retrieval. Extensive hashing methods have been designed for data stored in a single machine, that is, centralized hashing. In many real-world applications, however, the large-scale data are often distributed across different locations, servers, or sites. Although hashing for distributed data can be implemented by assembling all distributed data together as a whole dataset in theory, it usually leads to prohibitive computation, communication, and storage costs in practice. Up to now, only a few methods were tailored for distributed hashing, which are all unsupervised approaches. In this paper, we propose an efficient and effective method called supervised distributed hashing (SupDisH), which learns discriminative hash functions by leveraging the semantic label information in a distributed manner. Specifically, we cast the distributed hashing problem into the framework of classification, where the learned binary codes are expected to be distinct enough for semantic retrieval. By introducing auxiliary variables, the distributed model is then separated into a set of decentralized subproblems with consistency constraints, which can be solved in parallel on each vertex of the distributed network. As such, we can obtain high-quality distinctive unbiased binary codes and consistent hash functions with low computational complexity, which facilitate tackling large-scale multimedia retrieval tasks involving distributed datasets. Experimental evaluations on three large-scale datasets show that SupDisH is competitive to centralized hashing methods and outperforms the state-of-The-Art unsupervised distributed method significantly.

Original languageEnglish
Pages (from-to)675-686
Number of pages12
JournalIEEE Transactions on Multimedia
Volume20
Issue number3
DOIs
StatePublished - Mar 2018
Externally publishedYes

Keywords

  • Hash function learning
  • large-scale distributed data
  • multimedia retrieval
  • supervised distributed hashing

Fingerprint

Dive into the research topics of 'Supervised Distributed Hashing for Large-Scale Multimedia Retrieval'. Together they form a unique fingerprint.

Cite this