Self-supervised learning for colonoscopy: a study on polyp counting and model generalization

All endoscopy procedures are now guided by a video stream, which allows for a less invasive operation, reduced risks, and faster recovery. However, the endoscopy datasets available today represent only a very small fraction of all colonoscopies performed. For this reason, there has been a growing need to leverage all available endoscopy data, including unannotated data, which is relatively easier to collect and significantly cheaper than data manually annotated by expert endoscopists. The recent success of self-supervised pre-training strategies has not gone unnoticed in the medical computer vision community, which has quickly adopted them to address the problem of data and annotation scarcity. However, unlike self-supervised models on natural images—which can draw on hundreds of millions, if not billions, of examples—in colonoscopy the numbers usually reach only hundreds of thousands, at best a few million. This naturally raises an important question: are these general-purpose models robust enough to be applied to tasks different from those illustrated in the pre-training data? Can they also generalize to new datasets? The aim of this thesis is to analyze and compare this type of models on the task of polyp counting, which requires the ability to determine the number of polyps encountered throughout the video. Beyond this comparative study, this thesis will explore how representation learning with different objectives can affect downstream tasks.

Self-supervised learning for colonoscopy: a study on polyp counting and model generalization

BONGIOVANNI, ENRICA

2024/2025

Abstract

All endoscopy procedures are now guided by a video stream, which allows for a less invasive operation, reduced risks, and faster recovery. However, the endoscopy datasets available today represent only a very small fraction of all colonoscopies performed. For this reason, there has been a growing need to leverage all available endoscopy data, including unannotated data, which is relatively easier to collect and significantly cheaper than data manually annotated by expert endoscopists. The recent success of self-supervised pre-training strategies has not gone unnoticed in the medical computer vision community, which has quickly adopted them to address the problem of data and annotation scarcity. However, unlike self-supervised models on natural images—which can draw on hundreds of millions, if not billions, of examples—in colonoscopy the numbers usually reach only hundreds of thousands, at best a few million. This naturally raises an important question: are these general-purpose models robust enough to be applied to tasks different from those illustrated in the pre-training data? Can they also generalize to new datasets? The aim of this thesis is to analyze and compare this type of models on the task of polyp counting, which requires the ability to determine the number of polyps encountered throughout the video. Beyond this comparative study, this thesis will explore how representation learning with different objectives can affect downstream tasks.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Matematica "Tullio Levi-Civita" - DM
			
	Corso di studio
	
				DATA SCIENCE  Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2024
			
	Titolo inglese
	
				Self-supervised learning for colonoscopy: a study on polyp counting and model generalization
			
	Parola chiave
	
				colonoscopy
polyp counting
ss learning
			
	Relatore
	
				BALLAN, LAMBERTO
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Bongiovanni_Enrica.pdf accesso aperto Dimensione 2.85 MB Formato Adobe PDF Visualizza/Apri	2.85 MB	Adobe PDF	Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/102099