
Glycemic-aware metrics and oversampling techniques for predicting blood glucose levels using machine learning
Author(s) -
Michael Mayo,
Lynne Chepulis,
Ryan Paul
Publication year - 2019
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0225613
Subject(s) - artificial intelligence , computer science , support vector machine , machine learning , artificial pancreas , preprocessor , regression analysis , multilayer perceptron , hypoglycemia , glycemic , linear regression , regression , statistics , diabetes mellitus , insulin , medicine , mathematics , artificial neural network , type 1 diabetes , endocrinology
Techniques using machine learning for short term blood glucose level prediction in patients with Type 1 Diabetes are investigated. This problem is significant for the development of effective artificial pancreas technology so accurate alerts (e.g. hypoglycemia alarms) and other forecasts can be generated. It is shown that two factors must be considered when selecting the best machine learning technique for blood glucose level regression: (i) the regression model performance metrics being used to select the model, and (ii) the preprocessing techniques required to account for the imbalanced time spent by patients in different portions of the glycemic range. Using standard benchmark data, it is demonstrated that different regression model/preprocessing technique combinations exhibit different accuracies depending on the glycemic subrange under consideration. Therefore technique selection depends on the type of alert required. Specific findings are that a linear Support Vector Regression-based model, trained with normal as well as polynomial features, is best for blood glucose level forecasting in the normal and hyperglycemic ranges while a Multilayer Perceptron trained on oversampled data is ideal for predictions in the hypoglycemic range.