Decomposing the Site Frequency Spectrum: The Impact of Tree Topology on Neutrality Tests
Author(s) -
Luca Ferretti,
Alice Ledda,
Thomas Wiehe,
Guillaume Achaz,
Sebastián E. RamosOnsins
Publication year - 2017
Publication title -
genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.792
H-Index - 246
eISSN - 1943-2631
pISSN - 0016-6731
DOI - 10.1534/genetics.116.188763
Subject(s) - coalescent theory , tree (set theory) , estimator , statistics , population , neutral theory of molecular evolution , neutrality , mathematics , biology , statistical physics , phylogenetic tree , combinatorics , physics , genetics , demography , epistemology , philosophy , sociology , gene
We investigate the dependence of the site frequency spectrum on the topological structure of genealogical trees. We show that basic population genetic statistics, for instance, estimators of θ or neutrality tests such as Tajima's D , can be decomposed into components of waiting times between coalescent events and of tree topology. Our results clarify the relative impact of the two components on these statistics. We provide a rigorous interpretation of positive or negative values of an important class of neutrality tests in terms of the underlying tree shape. In particular, we show that values of Tajima's D and Fay and Wu's H depend in a direct way on a peculiar measure of tree balance, which is mostly determined by the root balance of the tree. We present a new test for selection in the same class as Fay and Wu's H and discuss its interpretation and power. Finally, we determine the trees corresponding to extreme expected values of these neutrality tests and present formulas for these extreme values as a function of sample size and number of segregating sites.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom