Overcoming sequence misalignments with weighted structural superposition | Zendy

Khazanov Nickolay A. | Zendy; DammGanamet Kelly L. | Zendy; Quang Daniel X. | Zendy; Carlson Heather A. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Overcoming sequence misalignments with weighted structural superposition

Author(s) -

Khazanov Nickolay A.,

DammGanamet Kelly L.,

Quang Daniel X.,

Carlson Heather A.

Publication year - 2012

Publication title -

proteins: structure, function, and bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.699

H-Index - 191

eISSN - 1097-0134

pISSN - 0887-3585

DOI - 10.1002/prot.24134

Subject(s) - alignment free sequence analysis , sequence (biology) , sequence alignment , superposition principle , structural alignment , computer science , algorithm , sequence logo , protein superfamily , multiple sequence alignment , computational biology , mathematics , peptide sequence , biology , genetics , mathematical analysis , gene

An appropriate structural superposition identifies similarities and differences between homologous proteins that are not evident from sequence alignments alone. We have coupled our Gaussian‐weighted RMSD (wRMSD) tool with a sequence aligner and seed extension (SE) algorithm to create a robust technique for overlaying structures and aligning sequences of homologous proteins (HwRMSD). HwRMSD overcomes errors in the initial sequence alignment that would normally propagate into a standard RMSD overlay. SE can generate a corrected sequence alignment from the improved structural superposition obtained by wRMSD. HwRMSD's robust performance and its superiority over standard RMSD are demonstrated over a range of homologous proteins. Its better overlay results in corrected sequence alignments with good agreement to HOMSTRAD. Finally, HwRMSD is compared to established structural alignment methods: FATCAT, secondary‐structure matching, combinatorial extension, and Dalilite. Most methods are comparable at placing residue pairs within 2 Å, but HwRMSD places many more residue pairs within 1 Å, providing a clear advantage. Such high accuracy is essential in drug design, where small distances can have a large impact on computational predictions. This level of accuracy is also needed to correct sequence alignments in an automated fashion, especially for omics‐scale analysis. HwRMSD can align homologs with low‐sequence identity and large conformational differences, cases where both sequence‐based and structural‐based methods may fail. The HwRMSD pipeline overcomes the dependency of structural overlays on initial sequence pairing and removes the need to determine the best sequence‐alignment method, substitution matrix, and gap parameters for each unique pair of homologs. Proteins 2012. © 2012 Wiley Periodicals, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research