Complete 3D Relationships Extraction Modality Alignment Network for 3D Dense Captioning
3D dense captioning aims to semantically describe each object detected in a 3D scene, which plays a significant role in 3D scene understanding. Previous works lack a complete definition of 3D spatial relationships and the directly integrate visual and language modalities, thus ignoring the discrepan...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on visualization and computer graphics. - 1996. - 30(2024), 8 vom: 01. Juli, Seite 4867-4880
|
1. Verfasser: |
Mao, Aihua
(VerfasserIn) |
Weitere Verfasser: |
Yang, Zhi,
Chen, Wanxin,
Yi, Ran,
Liu, Yong-Jin |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on visualization and computer graphics
|
Schlagworte: | Journal Article |