Cross-modal Guided Visual Representation Learning for Social Image Retrieval
Social images are often associated with rich but noisy tags from community contributions. Although social tags can potentially provide valuable semantic training information for image retrieval, existing studies all fail to effectively filter noises by exploiting the cross-modal correlation between...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2024) vom: 16. Dez.
|
1. Verfasser: |
Guan, Ziyu
(VerfasserIn) |
Weitere Verfasser: |
Zhao, Wanqing,
Liu, Hongmin,
Nakashima, Yuta,
Babaguchi, Noboru,
He, Xiaofei |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |