z-logo
Premium
Structure‐based design of combinatorial mutagenesis libraries
Author(s) -
Verma Deeptak,
Grigoryan Gevorg,
BaileyKellogg Chris
Publication year - 2015
Publication title -
protein science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.353
H-Index - 175
eISSN - 1469-896X
pISSN - 0961-8368
DOI - 10.1002/pro.2642
Subject(s) - sequence space , protein design , thermostability , computer science , mutagenesis , computational biology , combinatorial optimization , protein engineering , protein structure , theoretical computer science , algorithm , chemistry , mathematics , biology , mutation , genetics , biochemistry , pure mathematics , banach space , gene , enzyme
The development of protein variants with improved properties (thermostability, binding affinity, catalytic activity, etc.) has greatly benefited from the application of high‐throughput screens evaluating large, diverse combinatorial libraries. At the same time, since only a very limited portion of sequence space can be experimentally constructed and tested, an attractive possibility is to use computational protein design to focus libraries on a productive portion of the space. We present a general‐purpose method, called “Structure‐based Optimization of Combinatorial Mutagenesis ” (SOCoM ), which can optimize arbitrarily large combinatorial mutagenesis libraries directly based on structural energies of their constituents. SOCoM chooses both positions and substitutions, employing a combinatorial optimization framework based on library‐averaged energy potentials in order to avoid explicitly modeling every variant in every possible library. In case study applications to green fluorescent protein, β‐lactamase, and lipase A, SOCoM optimizes relatively small, focused libraries whose variants achieve energies comparable to or better than previous library design efforts, as well as larger libraries (previously not designable by structure‐based methods) whose variants cover greater diversity while still maintaining substantially better energies than would be achieved by representative random library approaches. By allowing the creation of large‐scale combinatorial libraries based on structural calculations, SOCoM promises to increase the scope of applicability of computational protein design and improve the hit rate of discovering beneficial variants. While designs presented here focus on variant stability (predicted by total energy), SOCoM can readily incorporate other structure‐based assessments, such as the energy gap between alternative conformational or bound states.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here