Divide and Conquer : Improving Multi-Camera 3D Perception With 2D Semantic-Depth Priors and Input-Dependent Queries

3D perception tasks, such as 3D object detection and Bird's-Eye-View (BEV) segmentation using multi-camera images, have drawn significant attention recently. Despite the fact that accurately estimating both semantic and 3D scene layouts are crucial for this task, existing techniques often negle...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 33(2024) vom: 18., Seite 897-909
Auteur principal: Song, Qi (Auteur)
Autres auteurs: Hu, Qingyong, Zhang, Chi, Chen, Yongquan, Huang, Rui
Format: Article en ligne
Langue:English
Publié: 2024
Accès à la collection:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Sujets:Journal Article