Prediction analysis for microbiome sequencing data | Zendy

Wang Tao | Zendy; Yang Can | Zendy; Zhao Hongyu | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Prediction analysis for microbiome sequencing data

Author(s) -

Wang Tao,

Yang Can,

Zhao Hongyu

Publication year - 2019

Publication title -

biometrics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.298

H-Index - 130

eISSN - 1541-0420

pISSN - 0006-341X

DOI - 10.1111/biom.13061

Subject(s) - microbiome , metagenomics , covariate , computer science , human microbiome , regression , data mining , regression analysis , expectation–maximization algorithm , machine learning , statistics , maximum likelihood , mathematics , biology , bioinformatics , biochemistry , gene

One goal of human microbiome studies is to relate host traits with human microbiome compositions. The analysis of microbial community sequencing data presents great statistical challenges, especially when the samples have different library sizes and the data are overdispersed with many zeros. To address these challenges, we introduce a new statistical framework, called predictive analysis in metagenomics via inverse regression (PAMIR), to analyze microbiome sequencing data. Within this framework, an inverse regression model is developed for overdispersed microbiota counts given the trait, and then a prediction rule is constructed by taking advantage of the dimension‐reduction structure in the model. An efficient Monte Carlo expectation‐maximization algorithm is proposed for maximum likelihood estimation. The method is further generalized to accommodate other types of covariates. We demonstrate the advantages of PAMIR through simulations and two real data examples.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research