z-logo
Premium
The Short‐chain Dehydrogenase/Reductase Engineering Database (SDRED): A classification and analysis system for a highly diverse enzyme family
Author(s) -
Gräff Maike,
Buchholz Patrick C.F.,
Stockinger Peter,
Bommarius Bettina,
Bommarius Andreas S.,
Pleiss Jürgen
Publication year - 2019
Publication title -
proteins: structure, function, and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.699
H-Index - 191
eISSN - 1097-0134
pISSN - 0887-3585
DOI - 10.1002/prot.25666
Subject(s) - sequence motif , sequence logo , sequence alignment , sequence (biology) , structural classification of proteins database , computational biology , sequence database , protein family , numbering , peptide sequence , structural alignment , structural motif , protein sequencing , consensus sequence , biology , multiple sequence alignment , genetics , computer science , protein structure , algorithm , biochemistry , dna , gene
The Short-chain Dehydrogenases/Reductases Engineering Database (SDRED) covers one of the largest known protein families (168 150 proteins). Assignment to the superfamilies of Classical and Extended SDRs was achieved by global sequence similarity and by identification of family-specific sequence motifs. Two standard numbering schemes were established for Classical and Extended SDRs that allow for the determination of conserved amino acid residues, such as cofactor specificity determining positions or superfamily specific sequence motifs. The comprehensive sequence dataset of the SDRED facilitates the refinement of family-specific sequence motifs. The glycine-rich motifs for Classical and Extended SDRs were refined to improve the precision of superfamily classification. In each superfamily, the majority of sequences formed a tightly connected sequence network and belonged to a large homologous family. Despite their different sequence motifs and their different sequence length, the two sequence networks of Classical and Extended SDRs are not separate, but connected by edges at a threshold of 40% sequence similarity, indicating that all SDRs belong to a large, connected network. The SDRED is accessible at https://sdred.biocatnet.de/.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here