RoMo : Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels
The rise of the metaverse and the increasing volume of heterogeneous 2D and 3D data have created a growing demand for cross-modal retrieval, enabling users to query semantically relevant data across different modalities. Existing methods heavily rely on class labels to bridge semantic correlations;...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 33(2024) vom: 01., Seite 5086-5097
|
1. Verfasser: |
Li, Yongxiang
(VerfasserIn) |
Weitere Verfasser: |
Qin, Yang,
Sun, Yuan,
Peng, Dezhong,
Peng, Xi,
Hu, Peng |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |