De Novo Molecular Design by Combining Deep Autoencoder Recurrent Neural Networks with Generative Topographic Mapping | Zendy

Boris Sattarov | Zendy; Igor I. Baskin | Zendy; Dragos Horvath | Zendy; Gilles Marcou | Zendy; Esben Jannik Bjerrum | Zendy; Alexandre Varnek | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

De Novo Molecular Design by Combining Deep Autoencoder Recurrent Neural Networks with Generative Topographic Mapping

Author(s) -

Boris Sattarov,

Igor I. Baskin,

Dragos Horvath,

Gilles Marcou,

Esben Jannik Bjerrum,

Alexandre Varnek

Publication year - 2019

Publication title -

journal of chemical information and modeling

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.24

H-Index - 160

eISSN - 1549-960X

pISSN - 1549-9596

DOI - 10.1021/acs.jcim.8b00751

Subject(s) - autoencoder , generative grammar , artificial intelligence , artificial neural network , computer science , generative model , deep neural networks , deep learning , recurrent neural network , pattern recognition (psychology)

Here we show that Generative Topographic Mapping (GTM) can be used to explore the latent space of the SMILES-based autoencoders and generate focused molecular libraries of interest. We have built a sequence-to-sequence neural network with Bidirectional Long Short-Term Memory layers and trained it on the SMILES strings from ChEMBL23. Very high reconstruction rates of the test set molecules were achieved (>98%), which are comparable to the ones reported in related publications. Using GTM, we have visualized the autoencoder latent space on the two-dimensional topographic map. Targeted map zones can be used for generating novel molecular structures by sampling associated latent space points and decoding them to SMILES. The sampling method based on a genetic algorithm was introduced to optimize compound properties "on the fly". The generated focused molecular libraries were shown to contain original and a priori feasible compounds which, pending actual synthesis and testing, showed encouraging behavior in independent structure-based affinity estimation procedures (pharmacophore matching, docking).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research