Disentangled Cross-Modal Transformer for RGB-D Salient Object Detection and Beyond
Previous multi-modal transformers for RGB-D salient object detection (SOD) generally directly connect all patches from two modalities to model cross-modal correlation and perform multi-modal combination without differentiation, which can lead to confusing and inefficient fusion. Instead, we disentan...
Publié dans: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 33(2024) vom: 14., Seite 1699-1709 |
---|---|
Auteur principal: | |
Autres auteurs: | , , , |
Format: | Article en ligne |
Langue: | English |
Publié: |
2024
|
Accès à la collection: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society |
Sujets: | Journal Article |
Accès en ligne |
Volltext |