Real-time Semantic Segmentation on Resource-Constrained Robotic Applications

This thesis tackles the challenge of implementing semantic segmentation, a data-intensive and computationally demanding process, in resource-constrained scenarios such as edge devices. The study centers on a dataset collected by Stiga from a lawn mower robot operating in a garden environment. The unique challenge for algorithms lies in the varying outdoor lighting and weather conditions, particularly due to the scarcity of data related to extreme weather conditions, which leads to the model’s performance degradation. The primary objective of this thesis is to achieve a target of 10 fps while maintaining satisfactory performance. The architectures of U-Net and DeepLabV3 were adapted, with the computationally intensive encoder components replaced by efficient blocks from MobileNetV2 and MobileNetV3. These blocks, pre-trained on ImageNet, can generate a hierarchy of high-level features that the decoder uses to produce an accurate segmentation map. The challenge posed by varying weather conditions is mitigated to some extent through data augmentation. To validate the effectiveness of the models designed in this study, five additional datasets, similar to the garden environment, were employed. The results reveal that training the models on both the encoder and decoder components led to enhanced performance. Specifically, the DeepLabV3 model with a MobileNetV2, encoder of depth 4 achieved superior performance, with a MeanIOU of 95.5% and a frame rate of 15.84 fps on the Stiga dataset. Furthermore, these models also demonstrated superior performance when applied to other datasets similar to the Stiga dataset.

Real-time Semantic Segmentation on Resource-Constrained Robotic Applications

DASARAJU, ABHISHEK VARMA

2023/2024

Abstract

This thesis tackles the challenge of implementing semantic segmentation, a data-intensive and computationally demanding process, in resource-constrained scenarios such as edge devices. The study centers on a dataset collected by Stiga from a lawn mower robot operating in a garden environment. The unique challenge for algorithms lies in the varying outdoor lighting and weather conditions, particularly due to the scarcity of data related to extreme weather conditions, which leads to the model’s performance degradation. The primary objective of this thesis is to achieve a target of 10 fps while maintaining satisfactory performance. The architectures of U-Net and DeepLabV3 were adapted, with the computationally intensive encoder components replaced by efficient blocks from MobileNetV2 and MobileNetV3. These blocks, pre-trained on ImageNet, can generate a hierarchy of high-level features that the decoder uses to produce an accurate segmentation map. The challenge posed by varying weather conditions is mitigated to some extent through data augmentation. To validate the effectiveness of the models designed in this study, five additional datasets, similar to the garden environment, were employed. The results reveal that training the models on both the encoder and decoder components led to enhanced performance. Specifically, the DeepLabV3 model with a MobileNetV2, encoder of depth 4 achieved superior performance, with a MeanIOU of 95.5% and a frame rate of 15.84 fps on the Stiga dataset. Furthermore, these models also demonstrated superior performance when applied to other datasets similar to the Stiga dataset.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
			Dipartimento di Matematica "Tullio Levi-Civita" - DM
		
	Corso di studio
	
			DATA SCIENCE Laurea Magistrale (D.M. 270/2004)
		
	Anno Accademico
	
			2023
		
	Titolo inglese
	
			Real-time Semantic Segmentation on Resource-Constrained Robotic Applications
		
	Parola chiave
	
			Computer Vision
Image Segmentation
Robotic Applications
AI
		
	Relatore
	
			BALLAN, LAMBERTO
		
	Appare nelle tipologie:
	
			Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Dasaraju_AbhishekVarma.pdf accesso riservato Dimensione 4.46 MB Formato Adobe PDF	4.46 MB	Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/62025