Initial Cluster Analysis | Zendy

Stephen F. Altschul | Zendy; Andrew F. Neuwald | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Initial Cluster Analysis

Author(s) -

Stephen F. Altschul,

Andrew F. Neuwald

Publication year - 2017

Publication title -

journal of computational biology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.585

H-Index - 95

eISSN - 1557-8666

pISSN - 1066-5277

DOI - 10.1089/cmb.2017.0050

Subject(s) - gtpase , guanine nucleotide exchange factor , sequence (biology) , cluster (spacecraft) , computational biology , string (physics) , biology , cluster analysis , elongation factor , computer science , genetics , physics , rna , artificial intelligence , ribosome , gene , programming language , quantum mechanics

We study a simple abstract problem motivated by a variety of applications in protein sequence analysis. Consider a string of 0s and 1s of length L, and containing D 1s. If we believe that some or all of the 1s may be clustered near the start of the sequence, which subset is the most significantly so clustered, and how significant is this clustering? We approach this question using the minimum description length principle and illustrate its application by analyzing residues that distinguish translational initiation and elongation factor guanosine triphosphatases (GTPases) from other P-loop GTPases. Within a structure of yeast elongation factor 1[Formula: see text], these residues form a significant cluster centered on a region implicated in guanine nucleotide exchange. Various biomedical questions may be cast as the abstract problem considered here.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research