A Comparison of Artificial Intelligence and Human Diabetic Retinal Image Interpretation in an Urban Health System
Author(s) -
Nikita Mokhashi,
Julia Grachevskaya,
Lorrie Cheng,
Daohai Yu,
Xiaoning Lu,
Yi Zhang,
Jeffrey Henderer
Publication year - 2021
Publication title -
journal of diabetes science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.039
H-Index - 75
eISSN - 1932-3107
pISSN - 1932-2968
DOI - 10.1177/1932296821999370
Subject(s) - medicine , mcnemar's test , diabetic retinopathy , optometry , eye examination , retinal , ophthalmology , diabetes mellitus , visual acuity , statistics , mathematics , endocrinology
Artificial intelligence (AI) diabetic retinopathy (DR) software has the potential to decrease time spent by clinicians on image interpretation and expand the scope of DR screening. We performed a retrospective review to compare Eyenuk’s EyeArt software (Woodland Hills, CA) to Temple Ophthalmology optometry grading using the International Classification of Diabetic Retinopathy scale.Methods: Two hundred and sixty consecutive diabetic patients from the Temple Faculty Practice Internal Medicine clinic underwent 2-field retinal imaging. Classifications of the images by the software and optometrist were analyzed using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and McNemar’s test. Ungradable images were analyzed to identify relationships with HbA1c, age, and ethnicity. Disagreements and a sample of 20% of agreements were adjudicated by a retina specialist.Results: On patient level comparison, sensitivity for the software was 100%, while specificity was 77.78%. PPV was 19.15%, and NPV was 100%. The 38 disagreements between software and optometrist occurred when the optometrist classified a patient’s images as non-referable while the software classified them as referable. Of these disagreements, a retina specialist agreed with the optometrist 57.9% the time (22/38). Of the agreements, the retina specialist agreed with both the program and the optometrist 96.7% of the time (28/29). There was a significant difference in numbers of ungradable photos in older patients (≥60) vs younger patients (<60) (p=0.003).Conclusions: The AI program showed high sensitivity with acceptable specificity for a screening algorithm. The high NPV indicates that the software is unlikely to miss DR but may refer patients unnecessarily.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom