A complete reference genome improves analysis of human genetic variation
Author(s) -
Sergey Aganezov,
Stephanie M. Yan,
Daniela C. Soto,
Melanie Kirsche,
Samantha Zarate,
Pavel Avdeyev,
Dylan J. Taylor,
Kishwar Shafin,
Alaina Shumate,
Chunlin Xiao,
Justin Wagner,
Jennifer McDaniel,
Nathan D. Olson,
Michael Sauria,
Mitchell R. Vollger,
Arang Rhie,
Melissa Meredith,
Skylar Martin,
Joyce Lee,
Sergey Koren,
Jeffrey Rosenfeld,
Benedict Paten,
Ryan M. Layer,
Chen-Shan Chin,
Fritz J. Sedlazeck,
Nancy F. Hansen,
Danny E. Miller,
Adam M. Phillippy,
Karen H. Miga,
Rajiv C. McCoy,
Megan Y. Dennis,
Justin M. Zook,
Michael C. Schatz
Publication year - 2022
Publication title -
science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 12.556
H-Index - 1186
eISSN - 1095-9203
pISSN - 0036-8075
DOI - 10.1126/science.abl3533
Subject(s) - reference genome , human genome , false positive paradox , genome , biology , computational biology , structural variation , genetics , spurious relationship , population , computer science , gene , artificial intelligence , medicine , machine learning , environmental health
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pairs of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for clinical and functional study. We show how this reference universally improves read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively. We identify hundreds of thousands of variants per sample in previously unresolved regions, showcasing the promise of the T2T-CHM13 reference for evolutionary and biomedical discovery. Simultaneously, this reference eliminates tens of thousands of spurious variants per sample, including reduction of false positives in 269 medically relevant genes by up to a factor of 12. Because of these improvements in variant discovery coupled with population and functional genomic resources, T2T-CHM13 is positioned to replace GRCh38 as the prevailing reference for human genetics.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom