z-logo
open-access-imgOpen Access
Generating stable molecules using imitation and reinforcement learning
Author(s) -
Søren Ager Meldgaard,
Jonas Köhler,
Henrik Lund Mortensen,
Mads-Peter V. Christiansen,
Frank Noé,
Bjørk Hammer
Publication year - 2021
Publication title -
machine learning science and technology
Language(s) - English
Resource type - Journals
ISSN - 2632-2153
DOI - 10.1088/2632-2153/ac3eb4
Subject(s) - chemical space , reinforcement learning , computer science , stability (learning theory) , representation (politics) , graph , artificial intelligence , set (abstract data type) , cartesian coordinate system , quantum chemical , machine learning , molecule , theoretical computer science , mathematics , chemistry , drug discovery , biochemistry , geometry , politics , political science , law , programming language , organic chemistry
Chemical space is routinely explored by machine learning methods to discover interesting molecules, before time-consuming experimental synthesizing is attempted. However, these methods often rely on a graph representation, ignoring 3D information necessary for determining the stability of the molecules. We propose a reinforcement learning (RL) approach for generating molecules in Cartesian coordinates allowing for quantum chemical prediction of the stability. To improve sample-efficiency we learn basic chemical rules from imitation learning (IL) on the GDB-11 database to create an initial model applicable for all stoichiometries. We then deploy multiple copies of the model conditioned on a specific stoichiometry in a RL setting. The models correctly identify low energy molecules in the database and produce novel isomers not found in the training set. Finally, we apply the model to larger molecules to show how RL further refines the IL model in domains far from the training data.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom