Disentangled Cross-Modal Transformer for RGB-D Salient Object Detection and Beyond

Previous multi-modal transformers for RGB-D salient object detection (SOD) generally directly connect all patches from two modalities to model cross-modal correlation and perform multi-modal combination without differentiation, which can lead to confusing and inefficient fusion. Instead, we disentan...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 33(2024) vom: 14., Seite 1699-1709
Auteur principal: Chen, Hao (Auteur)
Autres auteurs: Shen, Feihong, Ding, Ding, Deng, Yongjian, Li, Chao
Format: Article en ligne
Langue:English
Publié: 2024
Accès à la collection:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Sujets:Journal Article