Adaptive seeds tame genomic sequence comparison | Zendy

Szymon M. Kiełbasa | Zendy; Raymond Wan | Zendy; Kengo Sato | Zendy; Paul Horton | Zendy; Martin C. Frith | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Adaptive seeds tame genomic sequence comparison

Author(s) -

Szymon M. Kiełbasa,

Raymond Wan,

Kengo Sato,

Paul Horton,

Martin C. Frith

Publication year - 2011

Publication title -

genome research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.556

H-Index - 297

eISSN - 1549-5469

pISSN - 1088-9051

DOI - 10.1101/gr.113985.110

Subject(s) - biology , sequence (biology) , quadratic growth , composition (language) , computational biology , dna sequencing , algorithm , genetics , dna , computer science , linguistics , philosophy

The main way of analyzing biological sequences is by comparing and aligning them to each other. It remains difficult, however, to compare modern multi-billionbase DNA data sets. The difficulty is caused by the nonuniform (oligo)nucleotide composition of these sequences, rather than their size per se. To solve this problem, we modified the standard seed-and-extend approach (e.g., BLAST) to use adaptive seeds. Adaptive seeds are matches that are chosen based on their rareness, instead of using fixed-length matches. This method guarantees that the number of matches, and thus the running time, increases linearly, instead of quadratically, with sequence length. LAST, our open source implementation of adaptive seeds, enables fast and sensitive comparison of large sequences with arbitrarily nonuniform composition.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research