Hypothesis Testing and Power Calculations for Taxonomic-Based Human Microbiome Data
Author(s) -
Patricio S. La Rosa,
J. Paul Brooks,
Elena Deych,
Edward L. Boone,
David Edwards,
Qin Wang,
Erica Sodergren,
George M. Weinstock,
William D. Shan
Publication year - 2012
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0052078
Subject(s) - bootstrapping (finance) , microbiome , sample size determination , parametric statistics , computer science , statistical hypothesis testing , resampling , statistics , permutation (music) , data mining , computational biology , biology , bioinformatics , mathematics , econometrics , artificial intelligence , physics , acoustics
This paper presents new biostatistical methods for the analysis of microbiome data based on a fully parametric approach using all the data. The Dirichlet-multinomial distribution allows the analyst to calculate power and sample sizes for experimental design, perform tests of hypotheses (e.g., compare microbiomes across groups), and to estimate parameters describing microbiome properties. The use of a fully parametric model for these data has the benefit over alternative non-parametric approaches such as bootstrapping and permutation testing, in that this model is able to retain more information contained in the data. This paper details the statistical approaches for several tests of hypothesis and power/sample size calculations, and applies them for illustration to taxonomic abundance distribution and rank abundance distribution data using HMP Jumpstart data on 24 subjects for saliva, subgingival, and supragingival samples. Software for running these analyses is available.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom