Implementing a Computer Vision system able to recognize outdoor scenarios presents inherent difficulties related to the very high variability of environmental factors that make the task challenging and even poorly explored. The use case presented in this thesis is intended to focus on these types of scenarios, even more specifically with those that can be visited by lawn mowing robots, to compare and analyze the effectiveness and computational cost of various Computer Vision models specialized in the task of instance segmentation. In order to do this, both Convolutional Neural Network (CNN) and Vision Transformer (ViT) will be tested on multiple datasets representing outdoor environments with a series of experiments designed to test the capabilities and the limitations of these models.
Implementing a Computer Vision system able to recognize outdoor scenarios presents inherent difficulties related to the very high variability of environmental factors that make the task challenging and even poorly explored. The use case presented in this thesis is intended to focus on these types of scenarios, even more specifically with those that can be visited by lawn mowing robots, to compare and analyze the effectiveness and computational cost of various Computer Vision models specialized in the task of instance segmentation. In order to do this, both Convolutional Neural Network (CNN) and Vision Transformer (ViT) will be tested on multiple datasets representing outdoor environments with a series of experiments designed to test the capabilities and the limitations of these models.
A study of the effectiveness and computational cost of computer vision models for outdoor segmentation
BARI, FRANCESCO
2022/2023
Abstract
Implementing a Computer Vision system able to recognize outdoor scenarios presents inherent difficulties related to the very high variability of environmental factors that make the task challenging and even poorly explored. The use case presented in this thesis is intended to focus on these types of scenarios, even more specifically with those that can be visited by lawn mowing robots, to compare and analyze the effectiveness and computational cost of various Computer Vision models specialized in the task of instance segmentation. In order to do this, both Convolutional Neural Network (CNN) and Vision Transformer (ViT) will be tested on multiple datasets representing outdoor environments with a series of experiments designed to test the capabilities and the limitations of these models.File | Dimensione | Formato | |
---|---|---|---|
Bari_Francesco.pdf
accesso riservato
Dimensione
18.43 MB
Formato
Adobe PDF
|
18.43 MB | Adobe PDF |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/52321