z-logo
open-access-imgOpen Access
Neural networks to learn protein sequence–function relationships from deep mutational scanning data
Author(s) -
Sam Gelman,
Sarah A. Fahlberg,
Pete Heinzelman,
Philip A. Romero,
Anthony Gitter
Publication year - 2021
Publication title -
proceedings of the national academy of sciences
Language(s) - English
Resource type - Journals
eISSN - 1091-6490
pISSN - 0027-8424
DOI - 10.1073/pnas.2104878118
Subject(s) - sequence (biology) , artificial neural network , protein function , function (biology) , deep learning , artificial intelligence , computer science , computational biology , biology , genetics , gene
Significance Understanding the relationship between protein sequence and function is necessary to design new and useful proteins with applications in bioenergy, medicine, and agriculture. The mapping from sequence to function is tremendously complex because it involves thousands of molecular interactions that are coupled over multiple lengths and timescales. We show that neural networks can learn the sequence–function mapping from large protein datasets. Neural networks are appealing for this task because they can learn complicated relationships from data, make few assumptions about the nature of the sequence–function relationship, and can learn general rules that apply across the length of the protein sequence. We demonstrate that learned models can be applied to design new proteins with properties that exceed natural sequences.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here