Open Access
Identification of conserved transcriptome features between humans and Drosophila in the aging brain utilizing machine learning on combined data from the NIH Sequence Read Archive
Author(s) -
J. L. Webb,
Simon M. Moe,
Andrew Bolstad,
Elizabeth McNeill
Publication year - 2021
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0255085
Subject(s) - transcriptome , drosophila melanogaster , biology , conserved sequence , drosophila (subgenus) , computational biology , evolutionary biology , genetics , neurodegeneration , gene , identification (biology) , gene expression , disease , peptide sequence , ecology , medicine , pathology
Aging is universal, yet characterizing the molecular changes that occur in aging which lead to an increased risk for neurological disease remains a challenging problem. Aging affects the prefrontal cortex (PFC), which governs executive function, learning, and memory. Previous sequencing studies have demonstrated that aging alters gene expression in the PFC, however the extent to which these changes are conserved across species and are meaningful in neurodegeneration is unknown. Identifying conserved, age-related genetic and morphological changes in the brain allows application of the wealth of tools available to study underlying mechanisms in model organisms such as Drosophila melanogaster . RNA sequencing data from human PFC and fly heads were analyzed to determine conserved transcriptome signatures of age. Our analysis revealed that expression of 50 conserved genes can accurately determine age in Drosophila ( R 2 = 0.85) and humans ( R 2 = 0.46). These transcriptome signatures were also able to classify Drosophila into three age groups with a mean accuracy of 88% and classify human samples with a mean accuracy of 69%. Overall, this work identifies 50 highly conserved aging-associated genetic changes in the brain that can be further studied in model organisms and demonstrates a novel approach to uncovering genetic changes conserved across species from multi-study public databases.