
Data augmentation in dermatology image recognition using machine learning
Author(s) -
Aggarwal st Lt. Pushkar
Publication year - 2019
Publication title -
skin research and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.521
H-Index - 69
eISSN - 1600-0846
pISSN - 0909-752X
DOI - 10.1111/srt.12726
Subject(s) - rosacea , acne , atopic dermatitis , medicine , dermatology , psoriasis , impetigo , area under curve , area under the curve , artificial intelligence , computer science , pharmacokinetics
Background Each year in the United States, over 80 million people are affected by acne, atopic dermatitis, rosacea, psoriasis, and impetigo. Artificial intelligence and machine learning could prove to be a good tool for assisting in the diagnosis of dermatological conditions. The objective of this study was to evaluate the use of data augmentation in machine learning image recognition of five dermatological disease manifestations—acne, atopic dermatitis, impetigo, psoriasis, and rosacea. Materials and Methods Open‐source dermatological images were gathered and used to retrain TensorFlow Inception version‐3. Retraining was done twice—once with and once without data augmentation. Both models were tested with the same images, and R software was used to perform statistical analysis. Results The average of each of the statistical measures (sensitivity, specificity, PPV, NPN, MCC, and F1 Score) increased when data augmentation was added to the model. In particular, the average Matthews correlation coefficient increased by 7.7%. Each of the five dermatological manifestations had an increase in area under the curve (AUC) after data augmentation with the average increase in AUC of 0.132 and a standard deviation of 0.033. Atopic dermatitis had the highest increase in AUC of 0.18. With data augmentation, the lowest AUC was 0.87 for psoriasis and the highest was 0.97 for acne, indicating that the model performs well. Conclusion With a deep learning‐based approach, it is possible to differentiate dermatological images with appreciable MCC, F1 score, and AUC. Further, data augmentation can be used to increase the model's accuracy by a significant amount.