A Comparison of Artificial Intelligence and Human Diabetic Retinal Image Interpretation in an Urban Health System | Zendy

Nikita Mokhashi | Zendy; Julia Grachevskaya | Zendy; Lorrie Cheng | Zendy; Daohai Yu | Zendy; Xiaoning Lu | Zendy; Yi Zhang | Zendy; Jeffrey Henderer | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Comparison of Artificial Intelligence and Human Diabetic Retinal Image Interpretation in an Urban Health System

Author(s) -

Nikita Mokhashi,

Julia Grachevskaya,

Lorrie Cheng,

Daohai Yu,

Xiaoning Lu,

Yi Zhang,

Jeffrey Henderer

Publication year - 2021

Publication title -

journal of diabetes science and technology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.039

H-Index - 75

eISSN - 1932-3107

pISSN - 1932-2968

DOI - 10.1177/1932296821999370

Subject(s) - medicine , mcnemar's test , diabetic retinopathy , optometry , eye examination , retinal , ophthalmology , diabetes mellitus , visual acuity , statistics , mathematics , endocrinology

Artificial intelligence (AI) diabetic retinopathy (DR) software has the potential to decrease time spent by clinicians on image interpretation and expand the scope of DR screening. We performed a retrospective review to compare Eyenuk’s EyeArt software (Woodland Hills, CA) to Temple Ophthalmology optometry grading using the International Classification of Diabetic Retinopathy scale.Methods: Two hundred and sixty consecutive diabetic patients from the Temple Faculty Practice Internal Medicine clinic underwent 2-field retinal imaging. Classifications of the images by the software and optometrist were analyzed using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and McNemar’s test. Ungradable images were analyzed to identify relationships with HbA1c, age, and ethnicity. Disagreements and a sample of 20% of agreements were adjudicated by a retina specialist.Results: On patient level comparison, sensitivity for the software was 100%, while specificity was 77.78%. PPV was 19.15%, and NPV was 100%. The 38 disagreements between software and optometrist occurred when the optometrist classified a patient’s images as non-referable while the software classified them as referable. Of these disagreements, a retina specialist agreed with the optometrist 57.9% the time (22/38). Of the agreements, the retina specialist agreed with both the program and the optometrist 96.7% of the time (28/29). There was a significant difference in numbers of ungradable photos in older patients (≥60) vs younger patients (<60) (p=0.003).Conclusions: The AI program showed high sensitivity with acceptable specificity for a screening algorithm. The high NPV indicates that the software is unlikely to miss DR but may refer patients unnecessarily.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research