Estimating the Number of Subpopulations (K) in Structured Populations
Author(s) -
Robert Verity,
Richard A. Nichols
Publication year - 2016
Publication title -
genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.792
H-Index - 246
eISSN - 1943-2631
pISSN - 0016-6731
DOI - 10.1534/genetics.115.180992
Subject(s) - heuristics , estimator , inference , heuristic , range (aeronautics) , set (abstract data type) , statistics , suite , population , data set , algorithm , mathematics , computer science , biology , mathematical optimization , artificial intelligence , materials science , demography , archaeology , sociology , composite material , history , programming language
A key quantity in the analysis of structured populations is the parameter K, which describes the number of subpopulations that make up the total population. Inference of K ideally proceeds via the model evidence, which is equivalent to the likelihood of the model. However, the evidence in favor of a particular value of K cannot usually be computed exactly, and instead programs such as Structure make use of heuristic estimators to approximate this quantity. We show-using simulated data sets small enough that the true evidence can be computed exactly-that these heuristics often fail to estimate the true evidence and that this can lead to incorrect conclusions about K Our proposed solution is to use thermodynamic integration (TI) to estimate the model evidence. After outlining the TI methodology we demonstrate the effectiveness of this approach, using a range of simulated data sets. We find that TI can be used to obtain estimates of the model evidence that are more accurate and precise than those based on heuristics. Furthermore, estimates of K based on these values are found to be more reliable than those based on a suite of model comparison statistics. Finally, we test our solution in a reanalysis of a white-footed mouse data set. The TI methodology is implemented for models both with and without admixture in the software MavericK1.0.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom