z-logo
open-access-imgOpen Access
Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
Author(s) -
Kris V. Parag,
Oliver G. Pybus,
Chieh-Hsi Wu
Publication year - 2021
Publication title -
systematic biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 7.128
H-Index - 182
eISSN - 1076-836X
pISSN - 1063-5157
DOI - 10.1093/sysbio/syab037
Subject(s) - coalescent theory , prior probability , population , population size , inference , skyline , piecewise , smoothing , econometrics , bayesian probability , statistics , mathematics , computer science , biology , data mining , artificial intelligence , phylogenetics , mathematical analysis , biochemistry , demography , sociology , gene
In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, $\Omega$, to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using $\Omega$ we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose $\Omega$ as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.]

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here