Premium
Disease Detection Analytics: A Simple Linear Convex Programming Algorithm for Breast Cancer and Diabetes Incidence Decisions
Author(s) -
Mukhopadhyay Somnath,
Samaddar Subhashish,
Solis Adriano O.,
Roy Asim
Publication year - 2021
Publication title -
decision sciences
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.238
H-Index - 108
eISSN - 1540-5915
pISSN - 0011-7315
DOI - 10.1111/deci.12348
Subject(s) - computer science , decision tree , artificial intelligence , classifier (uml) , machine learning , naive bayes classifier , support vector machine , logistic regression , bayes classifier , random forest , pattern recognition (psychology) , algorithm , data mining
In the last couple of decades, data analytics‐based pattern classification methods for disease detection have gained much traction in healthcare research and applications. The current study builds linear programming (LP) models for detecting disease incidence. We propose sequential steps of a convex programming algorithm to construct decision boundary functions to classify patterns in disease detection data. We compare the performance of our LP‐based classifier with others (neural network, decision tree, k ‐nearest‐neighbor, logistic regression, naïve‐Bayes, and support‐vector‐machine) on four datasets: two different ones for breast cancer, and one each for diabetes and diabetic retinopathy. Statistical tests reveal that the LP classifier did significantly better than the other methods in five out of eight false‐positive and false‐negative test cases. There is not a statistically significant difference in performance in the remaining three tests between the LP classifier and the best alternative method. Most importantly, the LP classifier has significantly superior performance in both diabetes detection and diabetic retinopathy data. The success of the proposed LP classifier results from avoiding “modeling noise” and “memorization of training data.” We recommend that our proposed LP classifier be among the set of classifiers for use in disease detection analytics.