Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning

Learning-based hashing is a leading approach of approximate nearest neighbor search for large-scale image retrieval. In this paper, we develop a deep supervised hashing method for multi-label image retrieval, in which we propose to learn a binary "mask" map that can identify the approximat...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 27(2018), 9 vom: 21. Sept., Seite 4490-4502
1. Verfasser: Huang, Chang-Qin (VerfasserIn)
Weitere Verfasser: Yang, Shang-Ming, Pan, Yan, Lai, Han-Jiang
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2018
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article
LEADER 01000naa a22002652 4500
001 NLM285422367
003 DE-627
005 20231225045418.0
007 cr uuu---uuuuu
008 231225s2018 xx |||||o 00| ||eng c
024 7 |a 10.1109/TIP.2018.2839522  |2 doi 
028 5 2 |a pubmed24n0951.xml 
035 |a (DE-627)NLM285422367 
035 |a (NLM)29897874 
040 |a DE-627  |b ger  |c DE-627  |e rakwb 
041 |a eng 
100 1 |a Huang, Chang-Qin  |e verfasserin  |4 aut 
245 1 0 |a Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning 
264 1 |c 2018 
336 |a Text  |b txt  |2 rdacontent 
337 |a ƒaComputermedien  |b c  |2 rdamedia 
338 |a ƒa Online-Ressource  |b cr  |2 rdacarrier 
500 |a Date Completed 30.07.2018 
500 |a Date Revised 30.07.2018 
500 |a published: Print 
500 |a Citation Status PubMed-not-MEDLINE 
520 |a Learning-based hashing is a leading approach of approximate nearest neighbor search for large-scale image retrieval. In this paper, we develop a deep supervised hashing method for multi-label image retrieval, in which we propose to learn a binary "mask" map that can identify the approximate locations of objects in an image, so that we use this binary "mask" map to obtain length-limited hash codes which mainly focus on an image's objects but ignore the background. The proposed deep architecture consists of four parts: 1) a convolutional sub-network to generate effective image features; 2) a binary "mask" sub-network to identify image objects' approximate locations; 3) a weighted average pooling operation based on the binary "mask" to obtain feature representations and hash codes that pay most attention to foreground objects but ignore the background; and 4) the combination of a triplet ranking loss designed to preserve relative similarities among images and a cross entropy loss defined on image labels. We conduct comprehensive evaluations on four multi-label image data sets. The results indicate that the proposed hashing method achieves superior performance gains over the state-of-the-art supervised or unsupervised hashing baselines 
650 4 |a Journal Article 
700 1 |a Yang, Shang-Ming  |e verfasserin  |4 aut 
700 1 |a Pan, Yan  |e verfasserin  |4 aut 
700 1 |a Lai, Han-Jiang  |e verfasserin  |4 aut 
773 0 8 |i Enthalten in  |t IEEE transactions on image processing : a publication of the IEEE Signal Processing Society  |d 1992  |g 27(2018), 9 vom: 21. Sept., Seite 4490-4502  |w (DE-627)NLM09821456X  |x 1941-0042  |7 nnns 
773 1 8 |g volume:27  |g year:2018  |g number:9  |g day:21  |g month:09  |g pages:4490-4502 
856 4 0 |u http://dx.doi.org/10.1109/TIP.2018.2839522  |3 Volltext 
912 |a GBV_USEFLAG_A 
912 |a SYSFLAG_A 
912 |a GBV_NLM 
912 |a GBV_ILN_350 
951 |a AR 
952 |d 27  |j 2018  |e 9  |b 21  |c 09  |h 4490-4502