MsSVT++ : Mixed-Scale Sparse Voxel Transformer With Center Voting for 3D Object Detection

Accurate 3D object detection in large-scale outdoor scenes, characterized by considerable variations in object scales, necessitates features rich in both long-range and fine-grained information. While recent detectors have utilized window-based transformers to model long-range dependencies, they ten...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 5 vom: 03. Apr., Seite 3736-3752
1. Verfasser: Li, Jianan (VerfasserIn)
Weitere Verfasser: Dong, Shaocong, Ding, Lihe, Xu, Tingfa
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article