ERGC: an efficient referential genome compression algorithm | Zendy

Subrata Saha | Zendy; Sanguthevar Rajasekaran | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

ERGC: an efficient referential genome compression algorithm

Author(s) -

Subrata Saha,

Sanguthevar Rajasekaran

Publication year - 2015

Publication title -

bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.599

H-Index - 390

eISSN - 1367-4811

pISSN - 1367-4803

DOI - 10.1093/bioinformatics/btv399

Subject(s) - bottleneck , computer science , exploit , data compression , algorithm , implementation , reference genome , data mining , dna sequencing , genome , information bottleneck method , compression (physics) , artificial intelligence , biology , cluster analysis , gene , genetics , materials science , computer security , programming language , composite material , embedded system

Genome sequencing has become faster and more affordable. Consequently, the number of available complete genomic sequences is increasing rapidly. As a result, the cost to store, process, analyze and transmit the data is becoming a bottleneck for research and future medical applications. So, the need for devising efficient data compression and data reduction techniques for biological sequencing data is growing by the day. Although there exists a number of standard data compression algorithms, they are not efficient in compressing biological data. These generic algorithms do not exploit some inherent properties of the sequencing data while compressing. To exploit statistical and information-theoretic properties of genomic sequences, we need specialized compression algorithms. Five different next-generation sequencing data compression problems have been identified and studied in the literature. We propose a novel algorithm for one of these problems known as reference-based genome compression.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research