Premium
Navigating within thiamine diphosphate‐dependent decarboxylases: Sequences, structures, functional positions, and binding sites
Author(s) -
Buchholz Patrick C. F.,
Ferrario Valerio,
Pohl Martina,
Gardossi Lucia,
Pleiss Jürgen
Publication year - 2019
Publication title -
proteins: structure, function, and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.699
H-Index - 191
eISSN - 1097-0134
pISSN - 0887-3585
DOI - 10.1002/prot.25706
Subject(s) - thiamine , protein superfamily , sequence (biology) , sequence alignment , structural alignment , pairwise comparison , structural similarity , computational biology , multiple sequence alignment , enzyme , protein family , biology , stereochemistry , genetics , chemistry , peptide sequence , biochemistry , computer science , artificial intelligence , gene
Thiamine diphosphate‐dependent decarboxylases catalyze both cleavage and formation of CC bonds in various reactions, which have been assigned to different homologous sequence families. This work compares 53 ThDP‐dependent decarboxylases with known crystal structures. Both sequence and structural information were analyzed synergistically and data were analyzed for global and local properties by means of statistical approaches (principle component analysis and principal coordinate analysis) enabling complexity reduction. The different results obtained both locally and globally, that is, individual positions compared with the overall protein sequence or structure, revealed challenges in the assignment of separated homologous families. The methods applied herein support the comparison of enzyme families and the identification of functionally relevant positions. The findings for the family of ThDP‐dependent decarboxylases underline that global sequence identity alone is not sufficient to distinguish enzyme function. Instead, local sequence similarity, defined by comparisons of structurally equivalent positions, allows for a better navigation within several groups of homologous enzymes. The differentiation between homologous sequences is further enhanced by taking structural information into account, such as BioGPS analysis of the active site properties or pairwise structural superimpositions. The methods applied herein are expected to be transferrable to other enzyme families, to facilitate family assignments for homologous protein sequences.