Analyzing Learned Molecular Representations for Property Prediction | Zendy

Kevin Yang | Zendy; Kyle Swanson | Zendy; Wengong Jin | Zendy; Connor W. Coley | Zendy; Philipp Eiden | Zendy; Hua Gao | Zendy; Angel Guzmán-Pérez | Zendy; Timothy Hopper | Zendy; Brian Kelley | Zendy; Miriam Mathea | Zendy; Andrew Palmer | Zendy; Volker Settels | Zendy; Tommi Jaakkola | Zendy; Klavs F. Jensen | Zendy; Regina Barzilay | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Analyzing Learned Molecular Representations for Property Prediction

Author(s) -

Kevin Yang,

Kyle Swanson,

Wengong Jin,

Connor W. Coley,

Philipp Eiden,

Hua Gao,

Angel Guzmán-Pérez,

Timothy Hopper,

Brian Kelley,

Miriam Mathea,

Andrew Palmer,

Volker Settels,

Tommi Jaakkola,

Klavs F. Jensen,

Regina Barzilay

Publication year - 2019

Publication title -

journal of chemical information and modeling

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.24

H-Index - 160

eISSN - 1549-960X

pISSN - 1549-9596

DOI - 10.1021/acs.jcim.9b00237

Subject(s) - computer science , graph , molecular graph , benchmark (surveying) , artificial intelligence , chemical space , convolutional neural network , workflow , machine learning , property (philosophy) , artificial neural network , data mining , construct (python library) , representation (politics) , theoretical computer science , drug discovery , chemistry , database , biochemistry , philosophy , geodesy , epistemology , politics , political science , law , programming language , geography

Advancements in neural machinery have led to a wide range of algorithmic solutions for molecular property prediction. Two classes of models in particular have yielded promising results: neural networks applied to computed molecular fingerprints or expert-crafted descriptors and graph convolutional neural networks that construct a learned molecular representation by operating on the graph structure of the molecule. However, recent literature has yet to clearly determine which of these two methods is superior when generalizing to new chemical space. Furthermore, prior research has rarely examined these new models in industry research settings in comparison to existing employed models. In this paper, we benchmark models extensively on 19 public and 16 proprietary industrial data sets spanning a wide variety of chemical end points. In addition, we introduce a graph convolutional model that consistently matches or outperforms models using fixed molecular descriptors as well as previous graph neural architectures on both public and proprietary data sets. Our empirical findings indicate that while approaches based on these representations have yet to reach the level of experimental reproducibility, our proposed model nevertheless offers significant improvements over models currently used in industrial workflows.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research