z-logo
open-access-imgOpen Access
Dataset Construction to Explore Chemical Space with 3D Geometry and Deep Learning
Author(s) -
Jianing Lu,
Song Xia,
Jieyu Lü,
Yingkai Zhang
Publication year - 2021
Publication title -
journal of chemical information and modeling
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.24
H-Index - 160
eISSN - 1549-960X
pISSN - 1549-9596
DOI - 10.1021/acs.jcim.1c00007
Subject(s) - force field (fiction) , energy minimization , computer science , deep learning , space (punctuation) , basis (linear algebra) , field (mathematics) , chemical space , work (physics) , molecule , quality (philosophy) , crystal (programming language) , artificial intelligence , fragment (logic) , algorithm , computational chemistry , physics , geometry , chemistry , mathematics , thermodynamics , quantum mechanics , pure mathematics , programming language , drug discovery , operating system , biochemistry
A dataset is the basis of deep learning model development, and the success of deep learning models heavily relies on the quality and size of the dataset. In this work, we present a new data preparation protocol and build a large fragment-based dataset Frag20, which consists of optimized 3D geometries and calculated molecular properties from Merck molecular force field (MMFF) and DFT at the B3LYP/6-31G* level of theory for more than half a million molecules composed of H, B, C, O, N, F, P, S, Cl, and Br with no larger than 20 heavy atoms. Based on the new dataset, we develop robust molecular energy prediction models using a simplified PhysNet architecture for both DFT-optimized and MMFF-optimized geometries, which achieve better than or close to chemical accuracy (1 kcal/mol) on multiple test sets, including CSD20 and Plati20 based on experimental crystal structures.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom