RecoverY: k-mer-based read classification for Y-chromosome-specific sequencing and assembly
Author(s) -
Samarth Rangavittal,
Robert S. Harris,
Monika Čechová,
Marta Tomaszkiewicz,
Rayan Chikhi,
Kateryna D. Makova,
Paul Medvedev
Publication year - 2017
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btx771
Subject(s) - contig , computer science , sequence assembly , robustness (evolution) , ploidy , chromosome , process (computing) , computational biology , k mer , genome , biology , data mining , genetics , gene , gene expression , transcriptome , operating system
The haploid mammalian Y chromosome is usually under-represented in genome assemblies due to high repeat content and low depth due to its haploid nature. One strategy to ameliorate the low coverage of Y sequences is to experimentally enrich Y-specific material before assembly. As the enrichment process is imperfect, algorithms are needed to identify putative Y-specific reads prior to downstream assembly. A strategy that uses k-mer abundances to identify such reads was used to assemble the gorilla Y. However, the strategy required the manual setting of key parameters, a time-consuming process leading to sub-optimal assemblies.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom