z-logo
open-access-imgOpen Access
A big data pipeline: Identifying dynamic gene regulatory networks from time-course Gene Expression Omnibus data with applications to influenza infection
Author(s) -
Michelle Carey,
Juan Ramirez,
Shuang Wu,
Hulin Wu
Publication year - 2018
Publication title -
statistical methods in medical research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.952
H-Index - 85
eISSN - 1477-0334
pISSN - 0962-2802
DOI - 10.1177/0962280217746719
Subject(s) - gene regulatory network , pipeline (software) , computational biology , computer science , gene expression , regulation of gene expression , gene , data mining , biology , genetics , programming language
A biological host response to an external stimulus or intervention such as a disease or infection is a dynamic process, which is regulated by an intricate network of many genes and their products. Understanding the dynamics of this gene regulatory network allows us to infer the mechanisms involved in a host response to an external stimulus, and hence aids the discovery of biomarkers of phenotype and biological function. In this article, we propose a modeling/analysis pipeline for dynamic gene expression data, called Pipeline4DGEData, which consists of a series of statistical modeling techniques to construct dynamic gene regulatory networks from the large volumes of high-dimensional time-course gene expression data that are freely available in the Gene Expression Omnibus repository. This pipeline has a consistent and scalable structure that allows it to simultaneously analyze a large number of time-course gene expression data sets, and then integrate the results across different studies. We apply the proposed pipeline to influenza infection data from nine studies and demonstrate that interesting biological findings can be discovered with its implementation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here