Ancestry adjustment improves genome-wide estimates of regional intolerance
Author(s) -
Tristan J. Hayeck,
Nicholas Stong,
Evan H. Baugh,
Ryan S. Dhindsa,
Tychele N. Turner,
Ayan Malakar,
Timothy L. Mosbruger,
Grace Tzun-Wen Shaw,
Yuncheng Duan,
Iuliana IonitaLaza,
David B. Goldstein,
Andrew S. Allen
Publication year - 2022
Publication title -
genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.792
H-Index - 246
eISSN - 1943-2631
pISSN - 0016-6731
DOI - 10.1093/genetics/iyac050
Subject(s) - biology , selection (genetic algorithm) , genetics , genome , evolutionary biology , leverage (statistics) , population , negative selection , prioritization , computational biology , gene , machine learning , computer science , demography , management science , sociology , economics
Genomic regions subject to purifying selection are more likely to carry disease-causing mutations than regions not under selection. Cross species conservation is often used to identify such regions but with limited resolution to detect selection on short evolutionary timescales such as that occurring in only one species. In contrast, genetic intolerance looks for depletion of variation relative to expectation within a species, allowing species-specific features to be identified. When estimating the intolerance of noncoding sequence, methods strongly leverage variant frequency distributions. As the expected distributions depend on ancestry, if not properly controlled for, ancestral population source may obfuscate signals of selection. We demonstrate that properly incorporating ancestry in intolerance estimation greatly improved variant classification. We provide a genome-wide intolerance map that is conditional on ancestry and likely to be particularly valuable for variant prioritization.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom