Utilizing mapping targets of sequences underrepresented in the reference assembly to reduce false positive alignments | Zendy

Karen H. Miga | Zendy; Christopher Eisenhart | Zendy; W. James Kent | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Utilizing mapping targets of sequences underrepresented in the reference assembly to reduce false positive alignments

Author(s) -

Karen H. Miga,

Christopher Eisenhart,

W. James Kent

Publication year - 2015

Publication title -

nucleic acids research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.008

H-Index - 537

eISSN - 1362-4954

pISSN - 0305-1048

DOI - 10.1093/nar/gkv671

Subject(s) - biology , genome , computational biology , human genome , set (abstract data type) , centromere , genetics , reference genome , evolutionary biology , computer science , gene , chromosome , programming language

The human reference assembly remains incomplete due to the underrepresentation of repeat-rich sequences that are found within centromeric regions and acrocentric short arms. Although these sequences are marginally represented in the assembly, they are often fully represented in whole-genome short-read datasets and contribute to inappropriate alignments and high read-depth signals that localize to a small number of assembled homologous regions. As a consequence, these regions often provide artifactual peak calls that confound hypothesis testing and large-scale genomic studies. To address this problem, we have constructed mapping targets that represent roughly 8% of the human genome generally omitted from the human reference assembly. By integrating these data into standard mapping and peak-calling pipelines we demonstrate a 10-fold reduction in signals in regions common to the blacklisted region and identify a comprehensive set of regions that exhibit mapping sensitivity with the presence of the repeat-rich targets.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research