PonderV2 : Improved 3D Representation with A Universal Pre-training Paradigm
In contrast to numerous NLP and 2D vision foundational models, training a 3D foundational model poses considerably greater challenges. This is primarily due to the inherent data variability and diversity of downstream tasks. In this paper, we introduce a novel universal 3D pre-training framework des...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 18. Apr.
|
1. Verfasser: |
Zhu, Haoyi
(VerfasserIn) |
Weitere Verfasser: |
Yang, Honghui,
Wu, Xiaoyang,
Huang, Di,
Zhang, Sha,
He, Xianglong,
Zhao, Hengshuang,
Shen, Chunhua,
Qiao, Yu,
He, Tong,
Ouyang, Wanli |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2025
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |