Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.

Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.

Unraveling Retrosynthetic Pathways through Template-Free Generative Models

ROMAIN, CHELSIE ADELLE RENESSA
2023/2024

Abstract

Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.
2023
Unraveling Retrosynthetic Pathways through Template-Free Generative Models
Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.
Retrosynthesis
Generative Modelling
Machine Learning
File in questo prodotto:
File Dimensione Formato  
Romain C Unraveling Retrosynthetic Pathways through Template-Free Generative Models.pdf

accesso riservato

Dimensione 832.97 kB
Formato Adobe PDF
832.97 kB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/80901