Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning

Learning-based hashing is a leading approach of approximate nearest neighbor search for large-scale image retrieval. In this paper, we develop a deep supervised hashing method for multi-label image retrieval, in which we propose to learn a binary "mask" map that can identify the approximat...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 27(2018), 9 vom: 21. Sept., Seite 4490-4502
1. Verfasser:	Huang, Chang-Qin (VerfasserIn)
Weitere Verfasser:	Yang, Shang-Ming, Pan, Yan, Lai, Han-Jiang
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2018
Zugriff auf das übergeordnete Werk:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:	Journal Article


LEADER	01000naa a22002652 4500
001	NLM285422367
003	DE-627
005	20231225045418.0
007	cr uuu---uuuuu
008	231225s2018 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TIP.2018.2839522 \|2 doi
028	5	2	\|a pubmed24n0951.xml
035			\|a (DE-627)NLM285422367
035			\|a (NLM)29897874
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Huang, Chang-Qin \|e verfasserin \|4 aut
245	1	0	\|a Object-Location-Aware Hashing for Multi-Label Image Retrieval via Automatic Mask Learning
264		1	\|c 2018
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 30.07.2018
500			\|a Date Revised 30.07.2018
500			\|a published: Print
500			\|a Citation Status PubMed-not-MEDLINE
520			\|a Learning-based hashing is a leading approach of approximate nearest neighbor search for large-scale image retrieval. In this paper, we develop a deep supervised hashing method for multi-label image retrieval, in which we propose to learn a binary "mask" map that can identify the approximate locations of objects in an image, so that we use this binary "mask" map to obtain length-limited hash codes which mainly focus on an image's objects but ignore the background. The proposed deep architecture consists of four parts: 1) a convolutional sub-network to generate effective image features; 2) a binary "mask" sub-network to identify image objects' approximate locations; 3) a weighted average pooling operation based on the binary "mask" to obtain feature representations and hash codes that pay most attention to foreground objects but ignore the background; and 4) the combination of a triplet ranking loss designed to preserve relative similarities among images and a cross entropy loss defined on image labels. We conduct comprehensive evaluations on four multi-label image data sets. The results indicate that the proposed hashing method achieves superior performance gains over the state-of-the-art supervised or unsupervised hashing baselines
650		4	\|a Journal Article
700	1		\|a Yang, Shang-Ming \|e verfasserin \|4 aut
700	1		\|a Pan, Yan \|e verfasserin \|4 aut
700	1		\|a Lai, Han-Jiang \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on image processing : a publication of the IEEE Signal Processing Society \|d 1992 \|g 27(2018), 9 vom: 21. Sept., Seite 4490-4502 \|w (DE-627)NLM09821456X \|x 1941-0042 \|7 nnns
773	1	8	\|g volume:27 \|g year:2018 \|g number:9 \|g day:21 \|g month:09 \|g pages:4490-4502
856	4	0	\|u http://dx.doi.org/10.1109/TIP.2018.2839522 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 27 \|j 2018 \|e 9 \|b 21 \|c 09 \|h 4490-4502