Complete 3D Relationships Extraction Modality Alignment Network for 3D Dense Captioning

3D dense captioning aims to semantically describe each object detected in a 3D scene, which plays a significant role in 3D scene understanding. Previous works lack a complete definition of 3D spatial relationships and the directly integrate visual and language modalities, thus ignoring the discrepan...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on visualization and computer graphics. - 1996. - 30(2024), 8 vom: 01. Juli, Seite 4867-4880
1. Verfasser:	Mao, Aihua (VerfasserIn)
Weitere Verfasser:	Yang, Zhi, Chen, Wanxin, Yi, Ran, Liu, Yong-Jin
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2024
Zugriff auf das übergeordnete Werk:	IEEE transactions on visualization and computer graphics
Schlagworte:	Journal Article

Online verfügbar	Volltext