Uncertainty-Informed Deep Transfer Learning of Perfluoroalkyl and Polyfluoroalkyl Substance Toxicity | Zendy

Jeremy Feinstein | Zendy; Ganesh Sivaraman | Zendy; Kurt Picel | Zendy; Brian Peters | Zendy; Álvaro VázquezMayagoitia | Zendy; Arvind Ramanathan | Zendy; Margaret MacDonell | Zendy; Ian Foster | Zendy; Eugene Yan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Uncertainty-Informed Deep Transfer Learning of Perfluoroalkyl and Polyfluoroalkyl Substance Toxicity

Author(s) -

Jeremy Feinstein,

Ganesh Sivaraman,

Kurt Picel,

Brian Peters,

Álvaro VázquezMayagoitia,

Arvind Ramanathan,

Margaret MacDonell,

Ian Foster,

Eugene Yan

Publication year - 2021

Publication title -

journal of chemical information and modeling

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.24

H-Index - 160

eISSN - 1549-960X

pISSN - 1549-9596

DOI - 10.1021/acs.jcim.1c01204

Subject(s) - toxicity , transfer of learning , computer science , acute toxicity , deep learning , artificial intelligence , convolutional neural network , machine learning , chemistry , organic chemistry

Perfluoroalkyl and polyfluoroalkyl substances (PFAS) pose a significant hazard because of their widespread industrial uses, environmental persistence, and bioaccumulation. A growing, increasingly diverse inventory of PFAS, including 8163 chemicals, has recently been updated by the U.S. Environmental Protection Agency. However, with the exception of a handful of well-studied examples, little is known about their human toxicity potential because of the substantial resources required for in vivo toxicity experiments. We tackle the problem of expensive in vivo experiments by evaluating multiple machine learning (ML) methods, including random forests, deep neural networks (DNN), graph convolutional networks, and Gaussian processes, for predicting acute toxicity (e.g., median lethal dose, or LD 50 ) of PFAS compounds. To address the scarcity of toxicity information for PFAS, publicly available datasets of oral rat LD 50 for all organic compounds are aggregated and used to develop state-of-the-art ML source models for transfer learning. A total of 519 fluorinated compounds containing two or more C-F bonds with known toxicity are used for knowledge transfer to ensembles of the best-performing source model, DNN, to generate the target models for the PFAS domain with access to uncertainty. This study predicts toxicity for PFAS with a defined chemical structure. To further inform prediction confidence, the transfer-learned model is embedded within a SelectiveNet architecture, where the model is allowed to identify regions of prediction with greater confidence and abstain from those with high uncertainty using a calibrated cutoff rate.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research