3D Reconstruction of Indoor Scenes: a Neural Radiance Field Approach Supervised by Depth Priors

The target of this thesis is the geometric reconstruction of indoor scenes. We exploit an approach based on Neural Radiance Fields (NeRF) and able to learn an implicit representation of the scene geometry, starting from RGB images only. In particular, the geometry is learnt employing the Signed Distance Function (SDF) which estimates the closest surface for every point in the volume enclosing the scene. Starting from NeuS, an already existing method, we implement some supervision strategies to help the networks during the training phase. One approach performs a depth supervision based on the scene point cloud estimated by COLMAP, a Structure-from-Motion algorithm already used for camera pose estimation. The point cloud, obtained from features matched and triangulated over many views, adds sparse geometrical constraints to the geometry learnt by the model, thus increasing the reconstruction accuracy of complex structures. Then, we propose a novel depth supervision based on depth maps, in order to focus the NeRF learning on areas close to the surface. The last improvement is a color compensation strategy, to handle images acquired with variable exposure and white balancing settings. This leads to a more stable convergence and it helps the geometry estimation as well. Overall, the resulting method can produce accurate and colorful reconstructions of indoor environments. We test our method on indoor scenes, showing the effects of our implementations. In addition, we investigate the importance of testing on scenes acquired explicitly for NeRF based reconstruction, discussing the most important requirements to meet in the case of custom dataset acquisitions.

3D Reconstruction of Indoor Scenes: a Neural Radiance Field Approach Supervised by Depth Priors

LINCETTO, FEDERICO

2021/2022

Abstract

The target of this thesis is the geometric reconstruction of indoor scenes. We exploit an approach based on Neural Radiance Fields (NeRF) and able to learn an implicit representation of the scene geometry, starting from RGB images only. In particular, the geometry is learnt employing the Signed Distance Function (SDF) which estimates the closest surface for every point in the volume enclosing the scene. Starting from NeuS, an already existing method, we implement some supervision strategies to help the networks during the training phase. One approach performs a depth supervision based on the scene point cloud estimated by COLMAP, a Structure-from-Motion algorithm already used for camera pose estimation. The point cloud, obtained from features matched and triangulated over many views, adds sparse geometrical constraints to the geometry learnt by the model, thus increasing the reconstruction accuracy of complex structures. Then, we propose a novel depth supervision based on depth maps, in order to focus the NeRF learning on areas close to the surface. The last improvement is a color compensation strategy, to handle images acquired with variable exposure and white balancing settings. This leads to a more stable convergence and it helps the geometry estimation as well. Overall, the resulting method can produce accurate and colorful reconstructions of indoor environments. We test our method on indoor scenes, showing the effects of our implementations. In addition, we investigate the importance of testing on scenes acquired explicitly for NeRF based reconstruction, discussing the most important requirements to meet in the case of custom dataset acquisitions.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria dell'Informazione - DEI
			
	Corso di studio
	
				ICT FOR INTERNET AND MULTIMEDIA - INGEGNERIA PER LE COMUNICAZIONI MULTIMEDIALI E INTERNET Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2021
			
	Titolo inglese
	
				3D Reconstruction of Indoor Scenes: a Neural Radiance Field Approach Supervised by Depth Priors
			
	Abstract in italiano
	
				The target of this thesis is the geometric reconstruction of indoor scenes. We exploit an approach based on Neural Radiance Fields (NeRF) and able to learn an implicit representation of the scene geometry, starting from RGB images only. In particular, the geometry is learnt employing the Signed Distance Function (SDF) which estimates the closest surface for every point in the volume enclosing the scene.
Starting from NeuS, an already existing method, we implement some supervision strategies to help the networks during the training phase. One approach performs a depth supervision based on the scene point cloud estimated by COLMAP, a Structure-from-Motion algorithm already used for camera pose estimation. The point cloud, obtained from features matched and triangulated over many views, adds sparse geometrical constraints to the geometry learnt by the model, thus increasing the reconstruction accuracy of complex structures. Then, we propose a novel depth supervision based on depth maps, in order to focus the NeRF learning on areas close to the surface. The last improvement is a color compensation strategy, to handle images acquired with variable exposure and white balancing settings. This leads to a more stable convergence and it helps the geometry estimation as well. Overall, the resulting method can produce accurate and colorful reconstructions of indoor environments.
We test our method on indoor scenes, showing the effects of our implementations. In addition, we investigate the importance of testing on scenes acquired explicitly for NeRF based reconstruction, discussing the most important requirements to meet in the case of custom dataset acquisitions.
			
	Parola chiave
	
				3D Reconstruction
NeRF
Indoor Scenes
Deep Learning
Computer Vision
			
	Relatore
	
				ZANUTTIGH, PIETRO
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Lincetto_Federico.pdf embargo fino al 04/12/2025 Dimensione 30.34 MB Formato Adobe PDF	30.34 MB	Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/40294