
Machine learning-based estimation of cognitive performance using regional brain MRI markers: the Northern Manhattan Study
Author(s) -
Michelle R. Caunca,
Lily Wang,
Ying Kuen Cheung,
Noam Alperin,
Sang Lee,
Mitchell S.V. Elkind,
Ralph L. Sacco,
Clinton B. Wright,
Tatjana Rundek
Publication year - 2020
Publication title -
brain imaging and behavior
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.239
H-Index - 50
eISSN - 1931-7565
pISSN - 1931-7557
DOI - 10.1007/s11682-020-00325-3
Subject(s) - random forest , elastic net regularization , artificial intelligence , machine learning , regression , mean squared error , support vector machine , neuroimaging , statistics , cross validation , computer science , regression analysis , cognition , pattern recognition (psychology) , mathematics , psychology , psychiatry , neuroscience
High dimensional neuroimaging datasets and machine learning have been used to estimate and predict domain-specific cognition, but comparisons with simpler models composed of easy-to-measure variables are limited. Regularization methods in particular may help identify regions-of-interest related to domain-specific cognition. Using data from the Northern Manhattan Study, a cohort study of mostly Hispanic older adults, we compared three models estimating domain-specific cognitive performance: sociodemographics and APOE ε4 allele status (basic model), the basic model and MRI markers, and a model with only MRI markers. We used several machine learning methods to fit our regression models: elastic net, support vector regression, random forest, and principal components regression. Model performance was assessed with the RMSE, MAE, and R 2 statistics using 5-fold cross-validation. To assess whether prediction models with imaging biomarkers were more predictive than prediction models built with randomly generated biomarkers, we refit the elastic net models using 1000 datasets with random biomarkers and compared the distribution of the RMSE and R 2 in models using these random biomarkers to the RMSE and R 2 from observed models. Basic models explained ~ 31-38% of the variance in domain-specific cognition. Addition of MRI markers did not improve estimation. However, elastic net models with only MRI markers performed significantly better than random MRI markers (one-sided P < .05) and yielded regions-of-interest consistent with previous literature and others not previously explored. Therefore, structural brain MRI markers may be more useful for etiological than predictive modeling.