In recent years, semantic segmentation has emerged as a critical task in computer vision, vital for applications ranging from autonomous driving to medical image analysis. This task involves labeling each pixel in an image with semantic categories such as vehicles, pedestrians, and road signs. However, training deep learning models for semantic segmentation requires substantial amounts of labeled data, which is expensive and time-consuming to acquire. To address this challenge, data augmentation techniques are applied to artificially expand the training set and enhance model performance. Recent studies have highlighted the efficacy of augmenting data in feature space, where extracted features better capture semantic variations compared to raw input data. This approach leads to more realistic augmented data, thereby improving model robustness and generalization. Moreover, GAN-based methods have emerged as promising tools for data augmentation by generating synthetic data that closely resemble real-world data. This thesis investigates GAN-based methods for augmenting data in the feature space for semantic segmentation tasks. The proposed methodology involves extracting features using the encoder of a segmentation model, generating augmented features similar to real features using a GAN or similar generative model, and subsequently training the segmentation model decoder using both real and generated features. Experimental evaluations focus on assessing the effectiveness of this approach in enhancing the accuracy and generalization capabilities of DeepLabV3 segmentation model across Pascal VOC dataset.

In recent years, semantic segmentation has emerged as a critical task in computer vision, vital for applications ranging from autonomous driving to medical image analysis. This task involves labeling each pixel in an image with semantic categories such as vehicles, pedestrians, and road signs. However, training deep learning models for semantic segmentation requires substantial amounts of labeled data, which is expensive and time-consuming to acquire. To address this challenge, data augmentation techniques are applied to artificially expand the training set and enhance model performance. Recent studies have highlighted the efficacy of augmenting data in feature space, where extracted features better capture semantic variations compared to raw input data. This approach leads to more realistic augmented data, thereby improving model robustness and generalization. Moreover, GAN-based methods have emerged as promising tools for data augmentation by generating synthetic data that closely resemble real-world data. This thesis investigates GAN-based methods for augmenting data in the feature space for semantic segmentation tasks. The proposed methodology involves extracting features using the encoder of a segmentation model, generating augmented features similar to real features using a GAN or similar generative model, and subsequently training the segmentation model decoder using both real and generated features. Experimental evaluations focus on assessing the effectiveness of this approach in enhancing the accuracy and generalization capabilities of DeepLabV3 segmentation model across Pascal VOC dataset.

Data Augmentation through Generative Models

TOFFANIN, MATTIA
2023/2024

Abstract

In recent years, semantic segmentation has emerged as a critical task in computer vision, vital for applications ranging from autonomous driving to medical image analysis. This task involves labeling each pixel in an image with semantic categories such as vehicles, pedestrians, and road signs. However, training deep learning models for semantic segmentation requires substantial amounts of labeled data, which is expensive and time-consuming to acquire. To address this challenge, data augmentation techniques are applied to artificially expand the training set and enhance model performance. Recent studies have highlighted the efficacy of augmenting data in feature space, where extracted features better capture semantic variations compared to raw input data. This approach leads to more realistic augmented data, thereby improving model robustness and generalization. Moreover, GAN-based methods have emerged as promising tools for data augmentation by generating synthetic data that closely resemble real-world data. This thesis investigates GAN-based methods for augmenting data in the feature space for semantic segmentation tasks. The proposed methodology involves extracting features using the encoder of a segmentation model, generating augmented features similar to real features using a GAN or similar generative model, and subsequently training the segmentation model decoder using both real and generated features. Experimental evaluations focus on assessing the effectiveness of this approach in enhancing the accuracy and generalization capabilities of DeepLabV3 segmentation model across Pascal VOC dataset.
2023
Data Augmentation through Generative Models
In recent years, semantic segmentation has emerged as a critical task in computer vision, vital for applications ranging from autonomous driving to medical image analysis. This task involves labeling each pixel in an image with semantic categories such as vehicles, pedestrians, and road signs. However, training deep learning models for semantic segmentation requires substantial amounts of labeled data, which is expensive and time-consuming to acquire. To address this challenge, data augmentation techniques are applied to artificially expand the training set and enhance model performance. Recent studies have highlighted the efficacy of augmenting data in feature space, where extracted features better capture semantic variations compared to raw input data. This approach leads to more realistic augmented data, thereby improving model robustness and generalization. Moreover, GAN-based methods have emerged as promising tools for data augmentation by generating synthetic data that closely resemble real-world data. This thesis investigates GAN-based methods for augmenting data in the feature space for semantic segmentation tasks. The proposed methodology involves extracting features using the encoder of a segmentation model, generating augmented features similar to real features using a GAN or similar generative model, and subsequently training the segmentation model decoder using both real and generated features. Experimental evaluations focus on assessing the effectiveness of this approach in enhancing the accuracy and generalization capabilities of DeepLabV3 segmentation model across Pascal VOC dataset.
Data Augmentation
Generative Models
Image Segmentation
File in questo prodotto:
File Dimensione Formato  
Toffanin_Mattia.pdf

accesso aperto

Dimensione 4.19 MB
Formato Adobe PDF
4.19 MB Adobe PDF Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/76996