Comparative Evaluation of Object Detection Models in Colonoscopy

Colonoscopy is one of the most widely used and effective methods for the prevention and early detection of colorectal cancer, as it allows the identification and removal of precancerous polyps. To support endoscopists during the procedure, Computer-Aided Detection (CADe) systems based on deep learning have increasingly been adopted and usually rely on object detection models that, in real-time, locate and indicate the position of polyps within the image. From a computer vision perspective, the literature focuses on the development of object detectors based on the YOLO family, which are designed to operate in real-time and are therefore suitable for intraoperative use. However, if we move to the post-procedure phase, a period during which the video can be analyzed using more powerful and complex systems, the objective shifts from speed to accuracy. In this thesis, a benchmark is proposed for the evaluation of object detectors that prioritizes accuracy rather than inference speed. Object detectors based on both the YOLO family (YOLOv7, YOLOv11) and transformer-based models (RT-DETR) are compared through a unified and reproducible pipeline. Overall, this thesis provides a comprehensive and clinically relevant assessment of modern object detection models for polyp detection, offering insights into their strengths, limitations, and the factors influencing their robustness in realistic colonoscopy scenarios.

Comparative Evaluation of Object Detection Models in Colonoscopy

PRENDIN, LAURA

2024/2025

Abstract

Colonoscopy is one of the most widely used and effective methods for the prevention and early detection of colorectal cancer, as it allows the identification and removal of precancerous polyps. To support endoscopists during the procedure, Computer-Aided Detection (CADe) systems based on deep learning have increasingly been adopted and usually rely on object detection models that, in real-time, locate and indicate the position of polyps within the image. From a computer vision perspective, the literature focuses on the development of object detectors based on the YOLO family, which are designed to operate in real-time and are therefore suitable for intraoperative use. However, if we move to the post-procedure phase, a period during which the video can be analyzed using more powerful and complex systems, the objective shifts from speed to accuracy. In this thesis, a benchmark is proposed for the evaluation of object detectors that prioritizes accuracy rather than inference speed. Object detectors based on both the YOLO family (YOLOv7, YOLOv11) and transformer-based models (RT-DETR) are compared through a unified and reproducible pipeline. Overall, this thesis provides a comprehensive and clinically relevant assessment of modern object detection models for polyp detection, offering insights into their strengths, limitations, and the factors influencing their robustness in realistic colonoscopy scenarios.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Matematica "Tullio Levi-Civita" - DM
			
	Corso di studio
	
				DATA SCIENCE  Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2024
			
	Titolo inglese
	
				Comparative Evaluation of Object Detection Models in Colonoscopy
			
	Parola chiave
	
				Colonoscopy
Colorectal Cancer
CAD
Object Detection
			
	Relatore
	
				BALLAN, LAMBERTO
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Prendin_Laura.pdf accesso aperto Dimensione 3.24 MB Formato Adobe PDF Visualizza/Apri	3.24 MB	Adobe PDF	Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/102131