Multi-View Large Reconstruction Model via Geometry-Aware Positional Encoding and Attention

Despite recent advancements in the Large Reconstruction Model (LRM) demonstrating impressive results, when extending its input from single image to multiple images, it exhibits inefficiencies, subpar geometric and texture quality, as well as slower convergence speed than expected. It is attributed t...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on visualization and computer graphics. - 1996. - PP(2025) vom: 23. Mai
1. Verfasser: Li, Mengfei (VerfasserIn)
Weitere Verfasser: Long, Xiaoxiao, Liang, Yixun, Li, Weiyu, Liu, Yuan, Li, Peng, Luo, Wenhan, Wang, Wenping, Guo, Yike
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2025
Zugriff auf das übergeordnete Werk:IEEE transactions on visualization and computer graphics
Schlagworte:Journal Article