Premium
A GLM‐based latent variable ordination method for microbiome samples
Author(s) -
B. Sohn Michael,
Li Hongzhe
Publication year - 2018
Publication title -
biometrics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.298
H-Index - 130
eISSN - 1541-0420
pISSN - 0006-341X
DOI - 10.1111/biom.12775
Subject(s) - ordination , overdispersion , microbiome , covariate , latent variable , statistics , multivariate statistics , cluster analysis , computer science , dispersion (optics) , poisson distribution , econometrics , mathematics , count data , biology , bioinformatics , physics , optics
Summary Distance‐based ordination methods, such as principal coordinates analysis (PCoA), are widely used in the analysis of microbiome data. However, these methods are prone to pose a potential risk of misinterpretation about the compositional difference in samples across different populations if there is a difference in dispersion effects. Accounting for high sparsity and overdispersion of microbiome data, we propose a G LM‐based O rdination M ethod for M icrobiome S amples (GOMMS) in this article. This method uses a zero‐inflated quasi–Poisson (ZIQP) latent factor model. An EM algorithm based on the quasi‐likelihood is developed to estimate parameters. It performs comparatively to the distance‐based approach when dispersion effects are negligible and consistently better when dispersion effects are strong, where the distance‐based approach sometimes yields undesirable results. The estimated latent factors from GOMMS can be used to associate the microbiome community with covariates or outcomes using the standard multivariate tests, which can be investigated in future confirmatory experiments. We illustrate the method in simulations and an analysis of microbiome samples from nasopharynx and oropharynx.