Characterizing the Major Structural Variant Alleles of the Human Genome
Author(s) -
Peter A. Audano,
Arvis Sulovari,
Tina A. Graves-Lindsay,
Stuart Cantsilieris,
Melanie Sorensen,
AnneMarie E. Welch,
Max L. Dougherty,
Bradley J. Nelson,
Ankeeta Shah,
Susan K. Dutcher,
Wesley C. Warren,
Vincent Magrini,
Sean McGrath,
Yang Li,
Richard K. Wilson,
Evan E. Eichler
Publication year - 2019
Publication title -
cell
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 26.304
H-Index - 776
eISSN - 1097-4172
pISSN - 0092-8674
DOI - 10.1016/j.cell.2018.12.019
Subject(s) - biology , genetics , genome , human genome , 1000 genomes project , tandem repeat , genotyping , computational biology , structural variation , genome project , reference genome , annotation , gene , single nucleotide polymorphism , genotype
In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and noncoding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom