Collective Reconstructive Embeddings for Cross-modal Hashing

In this paper, we study the problem of cross-modal retrieval by hashing-based approximate nearest neighbor (ANN) search techniques. Most existing cross-modal hashing work mainly addresses the issue of multi-modal integration complexity using the same mapping and similarity calculation for data from...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - (2018) vom: 28. Dez.
1. Verfasser:	Hu, Mengqiu (VerfasserIn)
Weitere Verfasser:	Yang, Yang, Shen, Fumin, Xie, Ning, Hong, Richang, Shen, Heng Tao
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2018
Zugriff auf das übergeordnete Werk:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:	Journal Article


LEADER	01000caa a22002652 4500
001	NLM292311524
003	DE-627
005	20240229162108.0
007	cr uuu---uuuuu
008	231225s2018 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TIP.2018.2890144 \|2 doi
028	5	2	\|a pubmed24n1308.xml
035			\|a (DE-627)NLM292311524
035			\|a (NLM)30602421
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Hu, Mengqiu \|e verfasserin \|4 aut
245	1	0	\|a Collective Reconstructive Embeddings for Cross-modal Hashing
264		1	\|c 2018
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Revised 27.02.2024
500			\|a published: Print-Electronic
500			\|a Citation Status Publisher
520			\|a In this paper, we study the problem of cross-modal retrieval by hashing-based approximate nearest neighbor (ANN) search techniques. Most existing cross-modal hashing work mainly addresses the issue of multi-modal integration complexity using the same mapping and similarity calculation for data from different media types. Nonetheless, this may cause information loss during the mapping process due to overlooking the specifics of each individual modality. In this work, we propose a simple yet effective cross-modal hashing approach, termed Collective Reconstructive Embeddings (CRE), which can simultaneously solve the heterogeneity and integration complexity of multi-modal data. To address the heterogeneity challenge, we propose to process heterogeneous types of data using different modalityspecific models. Specifically, we model textual data with cosine similarity based reconstructive embedding to alleviate the data sparsity to the greatest extent, while for image data we utilize the Euclidean distance to characterize the relationships of the projected hash codes. Meanwhile, we unify the projections of text and image to the Hamming space into a common reconstructive embedding through rigid mathematical reformulation, which not only reduces the optimization complexity significantly but also facilitates the inter-modal similarity preservation among different modalities. We further incorporate the code balance and uncorrelation criteria into the problem, and devise an efficient iterative algorithm for optimization. Comprehensive experiments on four widely-used multimodal benchmarks show that the proposed CRE can achieve superior performance compared to the state-of-the-arts on several challenging cross-modal tasks
650		4	\|a Journal Article
700	1		\|a Yang, Yang \|e verfasserin \|4 aut
700	1		\|a Shen, Fumin \|e verfasserin \|4 aut
700	1		\|a Xie, Ning \|e verfasserin \|4 aut
700	1		\|a Hong, Richang \|e verfasserin \|4 aut
700	1		\|a Shen, Heng Tao \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on image processing : a publication of the IEEE Signal Processing Society \|d 1992 \|g (2018) vom: 28. Dez. \|w (DE-627)NLM09821456X \|x 1941-0042 \|7 nnns
773	1	8	\|g year:2018 \|g day:28 \|g month:12
856	4	0	\|u http://dx.doi.org/10.1109/TIP.2018.2890144 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|j 2018 \|b 28 \|c 12