PonderV2 : Improved 3D Representation with A Universal Pre-training Paradigm

In contrast to numerous NLP and 2D vision foundational models, training a 3D foundational model poses considerably greater challenges. This is primarily due to the inherent data variability and diversity of downstream tasks. In this paper, we introduce a novel universal 3D pre-training framework des...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 18. Apr.
1. Verfasser: Zhu, Haoyi (VerfasserIn)
Weitere Verfasser: Yang, Honghui, Wu, Xiaoyang, Huang, Di, Zhang, Sha, He, Xianglong, Zhao, Hengshuang, Shen, Chunhua, Qiao, Yu, He, Tong, Ouyang, Wanli
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2025
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article