Natural similarity measures between position frequency matrices with an application to clustering | Zendy

Utz Johann Pape | Zendy; Sven Rahmann | Zendy; Martin Vingron | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Natural similarity measures between position frequency matrices with an application to clustering

Author(s) -

Utz Johann Pape,

Sven Rahmann,

Martin Vingron

Publication year - 2008

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btm610

Subject(s) - cluster analysis , similarity (geometry) , position (finance) , computer science , natural (archaeology) , artificial intelligence , pattern recognition (psychology) , data mining , biology , business , paleontology , finance , image (mathematics)

Transcription factors (TFs) play a key role in gene regulation by binding to target sequences. In silico prediction of potential binding of a TF to a binding site is a well-studied problem in computational biology. The binding sites for one TF are represented by a position frequency matrix (PFM). The discovery of new PFMs requires the comparison to known PFMs to avoid redundancies. In general, two PFMs are similar if they occur at overlapping positions under a null model. Still, most existing methods compute similarity according to probabilistic distances of the PFMs. Here we propose a natural similarity measure based on the asymptotic covariance between the number of PFM hits incorporating both strands. Furthermore, we introduce a second measure based on the same idea to cluster a set of the Jaspar PFMs.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research