Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.
Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.
Unraveling Retrosynthetic Pathways through Template-Free Generative Models
ROMAIN, CHELSIE ADELLE RENESSA
2023/2024
Abstract
Retrosynthesis, the process of identifying reaction pathways to synthesize target molecules, is a critical task in computational chemistry. This work discusses various approaches to framing and solving the retrosynthetic problem using deep neural networks, distinguishing between single-step and multi-step methods, template-based and template-free models, and graph-based versus sequence-based representations. This project uses a generative model to generate the reaction pathways where each molecule is encoded in an embedding space. Three encoder architectures—Normalizing Flows (NF), Variational Autoencoders (VAE), and Autoencoders (AE)—are evaluated for their ability to encode molecular structures effectively. The generative model employed, Insert-Fill-Halt (IFH), generates retrosynthetic routes. The study also discusses appropriate evaluation metrics: reconstruction accuracy for encoders and tree edit distance, clustering diversity, and the proportion of solved routes for generative performance. The results reveal the strengths and limitations of the VAE and AE in reconstructive accuracy and concluding remarks emphasize directions for future works.File | Dimensione | Formato | |
---|---|---|---|
Romain C Unraveling Retrosynthetic Pathways through Template-Free Generative Models.pdf
accesso riservato
Dimensione
832.97 kB
Formato
Adobe PDF
|
832.97 kB | Adobe PDF |
The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License
https://hdl.handle.net/20.500.12608/80901