Cross-Modal Subspace Learning via Pairwise Constraints

In multimedia applications, the text and image components in a web document form a pairwise constraint that potentially indicates the same semantic concept. This paper studies cross-modal learning via the pairwise constraint and aims to find the common structure hidden in different modalities. We fi...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 24(2015), 12 vom: 26. Dez., Seite 5543-56
1. Verfasser:	He, Ran (VerfasserIn)
Weitere Verfasser:	Zhang, Man, Wang, Liang, Ji, Ye, Yin, Qiyue
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2015
Zugriff auf das übergeordnete Werk:	IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:	Journal Article Research Support, Non-U.S. Gov't


LEADER	01000naa a22002652 4500
001	NLM251697053
003	DE-627
005	20231224162549.0
007	cr uuu---uuuuu
008	231224s2015 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1109/TIP.2015.2466106 \|2 doi
028	5	2	\|a pubmed24n0839.xml
035			\|a (DE-627)NLM251697053
035			\|a (NLM)26259218
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a He, Ran \|e verfasserin \|4 aut
245	1	0	\|a Cross-Modal Subspace Learning via Pairwise Constraints
264		1	\|c 2015
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 03.02.2016
500			\|a Date Revised 27.01.2016
500			\|a published: Print-Electronic
500			\|a Citation Status PubMed-not-MEDLINE
520			\|a In multimedia applications, the text and image components in a web document form a pairwise constraint that potentially indicates the same semantic concept. This paper studies cross-modal learning via the pairwise constraint and aims to find the common structure hidden in different modalities. We first propose a compound regularization framework to address the pairwise constraint, which can be used as a general platform for developing cross-modal algorithms. For unsupervised learning, we propose a multi-modal subspace clustering method to learn a common structure for different modalities. For supervised learning, to reduce the semantic gap and the outliers in pairwise constraints, we propose a cross-modal matching method based on compound ℓ21 regularization. Extensive experiments demonstrate the benefits of joint text and image modeling with semantically induced pairwise constraints, and they show that the proposed cross-modal methods can further reduce the semantic gap between different modalities and improve the clustering/matching accuracy
650		4	\|a Journal Article
650		4	\|a Research Support, Non-U.S. Gov't
700	1		\|a Zhang, Man \|e verfasserin \|4 aut
700	1		\|a Wang, Liang \|e verfasserin \|4 aut
700	1		\|a Ji, Ye \|e verfasserin \|4 aut
700	1		\|a Yin, Qiyue \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t IEEE transactions on image processing : a publication of the IEEE Signal Processing Society \|d 1992 \|g 24(2015), 12 vom: 26. Dez., Seite 5543-56 \|w (DE-627)NLM09821456X \|x 1941-0042 \|7 nnns
773	1	8	\|g volume:24 \|g year:2015 \|g number:12 \|g day:26 \|g month:12 \|g pages:5543-56
856	4	0	\|u http://dx.doi.org/10.1109/TIP.2015.2466106 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a SYSFLAG_A
912			\|a GBV_NLM
912			\|a GBV_ILN_350
951			\|a AR
952			\|d 24 \|j 2015 \|e 12 \|b 26 \|c 12 \|h 5543-56