Premium
Reliability analysis framework for computer‐assisted medical decision systems
Author(s) -
Habas Piotr A.,
Zurada Jacek M.,
Elmaghraby Adel S.,
Tourassi Georgia D.
Publication year - 2007
Publication title -
medical physics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.473
H-Index - 180
eISSN - 2473-4209
pISSN - 0094-2405
DOI - 10.1118/1.2432409
Subject(s) - reliability (semiconductor) , cad , computer science , data mining , receiver operating characteristic , support vector machine , measure (data warehouse) , artificial neural network , artificial intelligence , set (abstract data type) , machine learning , pattern recognition (psychology) , power (physics) , quantum mechanics , engineering drawing , engineering , programming language , physics
We present a technique that enhances computer‐assisted decision (CAD) systems with the ability to assess the reliability of each individual decision they make. Reliability assessment is achieved by measuring the accuracy of a CAD system with known cases similar to the one in question. The proposed technique analyzes the feature space neighborhood of the query case to dynamically select an input‐dependent set of known cases relevant to the query. This set is used to assess the local (query‐specific) accuracy of the CAD system. The estimated local accuracy is utilized as a reliability measure of the CAD response to the query case. The underlying hypothesis of the study is that CAD decisions with higher reliability are more accurate. The above hypothesis was tested using a mammographic database of 1337 regions of interest (ROIs) with biopsy‐proven ground truth (681 with masses, 656 with normal parenchyma). Three types of decision models, ( i ) a back‐propagation neural network (BPNN), ( i i ) a generalized regression neural network (GRNN), and ( i i i ) a support vector machine (SVM), were developed to detect masses based on eight morphological features automatically extracted from each ROI. The performance of all decision models was evaluated using the Receiver Operating Characteristic (ROC) analysis. The study showed that the proposed reliability measure is a strong predictor of the CAD system's case‐specific accuracy. Specifically, the ROC area index for CAD predictions with high reliability was significantly better than for those with low reliability values. This result was consistent across all decision models investigated in the study. The proposed case‐specific reliability analysis technique could be used to alert the CAD user when an opinion that is unlikely to be reliable is offered. The technique can be easily deployed in the clinical environment because it is applicable with a wide range of classifiers regardless of their structure and it requires neither additional training nor building multiple decision models to assess the case‐specific CAD accuracy.