Skip to main navigation Skip to search Skip to main content

Deep-like hashing-in-hash for visual retrieval: An embarrassingly simple method

  • Lei ZHANG*
  • , Ji LIU
  • , Fuxiang HUANG
  • , Yang YANG
  • , David ZHANG*
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

Abstract

Existing hashing methods have yielded significant performance in image and multimedia retrieval, which can be categorized into two groups: shallow hashing and deep hashing. However, there still exist some intrinsic limitations among them. The former generally adopts a one-step strategy to learn the hashing codes for discovering the discriminative binary feature, but the latent discriminative information in the learned hashing codes is not well exploited. The latter, as deep neural network based hashing models, can learn highly discriminative and compact features, but relies on large-scale data and computation resources for numerous network parameters tuning with back-propagation optimization. Straightforward training of deep hashing models from scratch on small-scale data is almost impossible. Therefore, in order to develop efficient but effective learning to hash algorithm that depends only on small-scale data, we propose a novel non-neural network based deep-like learning framework, i.e. multi-level cascaded hashing (MCH) approach with hierarchical learning strategy, for image retrieval. The contributions are threefold. First, a hashing-in-hash architecture is designed in MCH, which inherits the excellent traits of traditional neural networks based deep learning, such that discriminative binary features that are beneficial to image retrieval can be effectively captured. Second, in each level the binary features of all preceding levels and the visual appearance feature are simultaneously cascaded as inputs of all subsequent levels to retrain, which fully exploits the implicated discriminative information. Third, a basic learning to hash (BLH) model with label constraint is proposed for hierarchical learning. Without loss of generality, the existing hashing models can be easily integrated into our MCH framework. We show experimentally on small- and large-scale visual retrieval tasks that our method outperforms several state-of-the-arts.
Original languageEnglish
Article number9153110
Pages (from-to)8149-8162
Number of pages14
JournalIEEE Transactions on Image Processing
Volume29
DOIs
Publication statusPublished - 31 Jul 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 1992-2012 IEEE.

Funding

This work was supported in part by the National Natural Science Fund of China under Grant 61771079, in part by the Chongqing Natural Science Fund under Grant cstc2018jcyjAX0250, in part by the Chongqing Youth Talent Program, Sichuan Science and Technology Program, China, under Grants 2020YFS0057, and in part by the Fundamental Research Funds for the Central Universities under Project ZYGX2019Z015.

Keywords

  • Cascaded hashing
  • hashing-in-hash architecture
  • image retrieval
  • multi-level learning

Fingerprint

Dive into the research topics of 'Deep-like hashing-in-hash for visual retrieval: An embarrassingly simple method'. Together they form a unique fingerprint.

Cite this