Unraveling Retrosynthetic Pathways through Template-Free Generative Models

Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.

Unraveling Retrosynthetic Pathways through Template-Free Generative Models

ROMAIN, CHELSIE ADELLE RENESSA

2023/2024

Abstract

Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Matematica "Tullio Levi-Civita" - DM
			
	Corso di studio
	
				DATA SCIENCE Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2023
			
	Titolo inglese
	
				Unraveling Retrosynthetic Pathways through Template-Free Generative Models
			
	Abstract in italiano
	
				Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.
			
	Parola chiave
	
				Retrosynthesis
Generative Modelling
Machine Learning
			
	Relatore
	
				SPERDUTI, ALESSANDRO
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Romain C Unraveling Retrosynthetic Pathways through Template-Free Generative Models.pdf Accesso riservato Dimensione 832.97 kB Formato Adobe PDF	832.97 kB	Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/80901