z-logo
open-access-imgOpen Access
Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
Author(s) -
Alexander Klassmann,
Mathieu Gautier
Publication year - 2022
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0262024
Subject(s) - haplotype , population , selection (genetic algorithm) , statistic , statistics , multiple comparisons problem , biology , computational biology , computer science , genetics , artificial intelligence , mathematics , genotype , demography , sociology , gene
Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here