Learning Reliable Gradients From Undersampled Circular Light Field for 3D Reconstruction

The paper presents a 3D reconstruction algorithm from an undersampled circular light ﬁeld (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be ca...

Description complète

Détails bibliographiques
Publié dans:	IEEE transactions on visualization and computer graphics. - 1996. - 29(2023), 12 vom: 26. Dez., Seite 5194-5207
Auteur principal:	Song, Zhengxi (Auteur)
Autres auteurs:	Wang, Xue, Zhu, Hao, Zhou, Guoqing, Wang, Qing
Format:	Article en ligne
Langue:	English
Publié:	2023
Accès à la collection:	IEEE transactions on visualization and computer graphics
Sujets:	Journal Article

Description
Résumé:	The paper presents a 3D reconstruction algorithm from an undersampled circular light ﬁeld (LF). With an ultra-dense angular sampling rate, every scene point captured by a circular LF corresponds to a smooth trajectory in the circular epipolar plane volume (CEPV). Thus per-pixel disparities can be calculated by retrieving the local gradients of the CEPV-trajectories. However, the continuous curve will be broken up into discrete segments in an undersampled circular LF, which leads to a noticeable deterioration of the 3D reconstruction accuracy. We observe that the coherent structure is still embedded in the discrete segments. With less noise and ambiguity, the scene points can be reconstructed using gradients from reliable epipolar plane image (EPI) regions. By analyzing the geometric characteristics of the coherent structure in the CEPV, both the trajectory itself and its gradients could be modeled as 3D predictable series. Thus a mask-guided CNN+LSTM network is proposed to learn the mapping from the CEPV with a lower angular sampling rate to the gradients under a higher angular sampling rate. To segment the reliable regions, the reliable-mask-based loss that assesses the difference between learned gradients and ground truth gradients is added to the loss function. We construct a synthetic circular LF dataset with ground truth for depth and foreground/background segmentation to train the network. Moreover, a real-scene circular LF dataset is collected for performance evaluation. Experimental results on both public and self-constructed datasets demonstrate the superiority of the proposed method over existing state-of-the-art methods
Description:	Date Revised 22.11.2023 published: Print-Electronic Citation Status PubMed-not-MEDLINE
ISSN:	1941-0506
DOI:	10.1109/TVCG.2022.3206207