GTRAC: fast retrieval from compressed collections of genomic variants | Zendy

Kedar Tatwawadi | Zendy; Mikel Hernáez | Zendy; Idoia Ochoa | Zendy; Tsachy Weissman | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

GTRAC: fast retrieval from compressed collections of genomic variants

Author(s) -

Kedar Tatwawadi,

Mikel Hernáez,

Idoia Ochoa,

Tsachy Weissman

Publication year - 2016

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btw437

Subject(s) - computer science , random access , data compression , genome , rendering (computer graphics) , data mining , database , computational biology , information retrieval , algorithm , artificial intelligence , biology , genetics , gene , operating system

The dramatic decrease in the cost of sequencing has resulted in the generation of huge amounts of genomic data, as evidenced by projects such as the UK10K and the Million Veteran Project, with the number of sequenced genomes ranging in the order of 10 K to 1 M. Due to the large redundancies among genomic sequences of individuals from the same species, most of the medical research deals with the variants in the sequences as compared with a reference sequence, rather than with the complete genomic sequences. Consequently, millions of genomes represented as variants are stored in databases. These databases are constantly updated and queried to extract information such as the common variants among individuals or groups of individuals. Previous algorithms for compression of this type of databases lack efficient random access capabilities, rendering querying the database for particular variants and/or individuals extremely inefficient, to the point where compression is often relinquished altogether.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research