Data segmentation represents a challenging task in modern machine learning. In many real-world applications, data are naturally structured as graphs and labeled data are often expensive to obtain, motivating the use of graph-based semi-supervised learning techniques. Existing methods inspired by fluid dynamics and materials science minimize a graph-based Ginzburg–Landau functional, but require expensive graph Laplacian eigendecompositions and repeated projections onto the unit simplex, making them impractical for very large datasets. A recent work proposed a new penalty-based reformulation, combined with an efficient eigendecomposition and projection-free optimization scheme, based on a tailored variant of the classic Frank-Wolfe method. In this thesis we propose block-coordinate variants of the Frank-Wolfe scheme to reduce the computational burden, while achieving a similar convergence rate. Additionally, we analyze an inference-based approach to the problem. We test the proposed methods on artificial and real-world networks, showing their effectiveness.

Projection-free methods for data segmentation problems

SALVETTI, LAURA
2025/2026

Abstract

Data segmentation represents a challenging task in modern machine learning. In many real-world applications, data are naturally structured as graphs and labeled data are often expensive to obtain, motivating the use of graph-based semi-supervised learning techniques. Existing methods inspired by fluid dynamics and materials science minimize a graph-based Ginzburg–Landau functional, but require expensive graph Laplacian eigendecompositions and repeated projections onto the unit simplex, making them impractical for very large datasets. A recent work proposed a new penalty-based reformulation, combined with an efficient eigendecomposition and projection-free optimization scheme, based on a tailored variant of the classic Frank-Wolfe method. In this thesis we propose block-coordinate variants of the Frank-Wolfe scheme to reduce the computational burden, while achieving a similar convergence rate. Additionally, we analyze an inference-based approach to the problem. We test the proposed methods on artificial and real-world networks, showing their effectiveness.
2025
Projection-free methods for data segmentation problems
data segmentation
projection-free
Frank-Wolfe
semi supervised
File in questo prodotto:
File Dimensione Formato  
tesi_salvetti.pdf

accesso aperto

Dimensione 1.16 MB
Formato Adobe PDF
1.16 MB Adobe PDF Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/108129