Crumble: reference free lossy compression of sequence quality values
Author(s) -
James Bonfield,
Shane McCarthy,
Richard Durbin
Publication year - 2018
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/bty608
Subject(s) - computer science , file size , database , lossy compression , file format , quality (philosophy) , set (abstract data type) , base (topology) , reduction (mathematics) , compression (physics) , sequence (biology) , data mining , operating system , mathematics , programming language , biology , mathematical analysis , philosophy , materials science , geometry , epistemology , composite material , genetics
The bulk of space taken up by NGS sequencing CRAM files consists of per-base quality values. Most of these are unnecessary for variant calling, offering an opportunity for space saving.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom