
Optimization of deep learning methods for visualization of tumor heterogeneity and brain tumor grading through digital pathology
Author(s) -
An Hoai Truong,
Viktoriia Sharmanska,
Clara LimbackStanic,
Matthew GrechSollars
Publication year - 2020
Publication title -
neuro-oncology advances
Language(s) - English
Resource type - Journals
ISSN - 2632-2498
DOI - 10.1093/noajnl/vdaa110
Subject(s) - computer science , overfitting , visualization , artificial intelligence , grading (engineering) , convolutional neural network , pattern recognition (psychology) , digital pathology , artificial neural network , machine learning , civil engineering , engineering
Background Variations in prognosis and treatment options for gliomas are dependent on tumor grading. When tissue is available for analysis, grade is established based on histological criteria. However, histopathological diagnosis is not always reliable or straight-forward due to tumor heterogeneity, sampling error, and subjectivity, and hence there is great interobserver variability in readings.Methods We trained convolutional neural network models to classify digital whole-slide histopathology images from The Cancer Genome Atlas. We tested a number of optimization parameters.Results Data augmentation did not improve model training, while a smaller batch size helped to prevent overfitting and led to improved model performance. There was no significant difference in performance between a modular 2-class model and a single 3-class model system. The best models trained achieved a mean accuracy of 73% in classifying glioblastoma from other grades and 53% between WHO grade II and III gliomas. A visualization method was developed to convey the model output in a clinically relevant manner by overlaying color-coded predictions over the original whole-slide image.Conclusions Our developed visualization method reflects the clinical decision-making process by highlighting the intratumor heterogeneity and may be used in a clinical setting to aid diagnosis. Explainable artificial intelligence techniques may allow further evaluation of the model and underline areas for improvements such as biases. Due to intratumor heterogeneity, data annotation for training was imprecise, and hence performance was lower than expected. The models may be further improved by employing advanced data augmentation strategies and using more precise semiautomatic or manually labeled training data.