This thesis addresses the challenge of enhancing downstream model performance in semantic segmentation tasks by tackling the prevalent issue of incomplete or sparse 3D data in real-world scenarios. Our work focuses on semantic completion and rendering of 3D scenes as a crucial preceding data augmentation step. The approach utilizes advanced machine learning techniques, particularly generative models and neural rendering, to synthesize missing scene information, thereby improving the data's representation. By providing enriched 3D representations, this methodology aims to increase the accuracy of downstream tasks, including object detection. We demonstrate the effectiveness of this strategy by showcasing various performance metrics and its potential to bridge the gap between ideal synthetic environments and noisy real-world sensor data.
Semantic Completion and Rendering of 3D Scenes to Improve Downstream Model Performance
CANEL, ALESSANDRO
2024/2025
Abstract
This thesis addresses the challenge of enhancing downstream model performance in semantic segmentation tasks by tackling the prevalent issue of incomplete or sparse 3D data in real-world scenarios. Our work focuses on semantic completion and rendering of 3D scenes as a crucial preceding data augmentation step. The approach utilizes advanced machine learning techniques, particularly generative models and neural rendering, to synthesize missing scene information, thereby improving the data's representation. By providing enriched 3D representations, this methodology aims to increase the accuracy of downstream tasks, including object detection. We demonstrate the effectiveness of this strategy by showcasing various performance metrics and its potential to bridge the gap between ideal synthetic environments and noisy real-world sensor data.| File | Dimensione | Formato | |
|---|---|---|---|
|
Tesi_Canel_Alessandro.pdf
accesso aperto
Dimensione
9.63 MB
Formato
Adobe PDF
|
9.63 MB | Adobe PDF | Visualizza/Apri |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/102083