Tajima's D and Site-Specific Nucleotide Frequency in a Population during an Infectious Disease Outbreak
Author(s) -
Ryosuke Omori,
Jian Wu
Publication year - 2017
Publication title -
siam journal on applied mathematics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.954
H-Index - 99
eISSN - 1095-712X
pISSN - 0036-1399
DOI - 10.1137/17m1114946
Subject(s) - biology , population , mutation rate , epidemic model , genetic diversity , outbreak , nucleotide diversity , statistics , evolutionary biology , statistical physics , genetics , demography , mathematics , physics , genotype , virology , haplotype , sociology , gene
Tajima's D measures the difference between two estimates of genetic diversity in a given set of nucleic acid sequences. Here we show how Tajima's D can be calculated/estimated by developing an inductive algorithm for calculating the site-specific nucleotide frequencies from a standard multistrain susceptible-infective-removed (SIR) model (both deterministic and stochastic). We show that these frequencies are fully determined by the mutation rate and the initial condition of the frequencies. We prove that the sign of Tajima's D is independent of the disease population dynamics and that the negative sign does not imply an expansion of the infected population in the deterministic model. Using individual-based simulations, we also show that dependence of Tajima's D on the disease transmission and evolution dynamics is a result of the stochasticity of those dynamics. The same is true for the dependence related to genetic diversity of a pathogen.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom