The rise of generative models has intensified the challenge of detecting manipulated images, as these tools create increasingly convincing forgeries, making multimedia forensics a critical field. This thesis provides a complete analysis of state-of-the-art methods for detecting image manipulations. It includes a survey of existing approaches, replication of key experiments to address unclear or underreported findings, and an examination of common limitations and failure modes. Traditionally, image manipulation detection has been treated as a binary classification problem (real vs. fake), which restricts the ability to identify and localize diverse manipulation types. To overcome the limitations of binary classification, this work enhances an already existing detector by developing a unified multiclass framework (real vs. synthetic vs. tampered). This framework not only categorizes the image authenticity but also integrates a segmentation branch for pixel-level localization of tampered regions. The results demonstrate that: (1) replicating prior studies highlights vulnerabilities to preprocessing decisions and dataset inconsistencies, (2) the multiclass approach enables a more granular diagnosis of image authenticity and, when coupled with a dedicated segmentation branch, achieves state-of-the-art localization performance on its primary benchmark, (3) despite these advancements, significant challenges in cross-dataset generalization and robustness to diverse manipulation types persist. The primary contributions of this work include a critical review of the literature, a detailed analysis of failure modes, the development and evaluation of a multiclass detection framework, and a discussion of unresolved challenges and potential future research directions.

From Detection to Localization: Advances in Image Forgery Analysis with AI

BRIGO, MARCO
2024/2025

Abstract

The rise of generative models has intensified the challenge of detecting manipulated images, as these tools create increasingly convincing forgeries, making multimedia forensics a critical field. This thesis provides a complete analysis of state-of-the-art methods for detecting image manipulations. It includes a survey of existing approaches, replication of key experiments to address unclear or underreported findings, and an examination of common limitations and failure modes. Traditionally, image manipulation detection has been treated as a binary classification problem (real vs. fake), which restricts the ability to identify and localize diverse manipulation types. To overcome the limitations of binary classification, this work enhances an already existing detector by developing a unified multiclass framework (real vs. synthetic vs. tampered). This framework not only categorizes the image authenticity but also integrates a segmentation branch for pixel-level localization of tampered regions. The results demonstrate that: (1) replicating prior studies highlights vulnerabilities to preprocessing decisions and dataset inconsistencies, (2) the multiclass approach enables a more granular diagnosis of image authenticity and, when coupled with a dedicated segmentation branch, achieves state-of-the-art localization performance on its primary benchmark, (3) despite these advancements, significant challenges in cross-dataset generalization and robustness to diverse manipulation types persist. The primary contributions of this work include a critical review of the literature, a detailed analysis of failure modes, the development and evaluation of a multiclass detection framework, and a discussion of unresolved challenges and potential future research directions.
2024
From Detection to Localization: Advances in Image Forgery Analysis with AI
AI image detection
Image forgery
Forgery localization
File in questo prodotto:
File Dimensione Formato  
THESIS_MSC_MARCOBRIGO.pdf

accesso aperto

Dimensione 2.72 MB
Formato Adobe PDF
2.72 MB Adobe PDF Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/102082