Premium
Assessment of performance and reproducibility of applying a content‐based image retrieval scheme for classification of breast lesions
Author(s) -
Gundreddy Rohith Reddy,
Tan Maxine,
Qiu Yuchen,
Cheng Samuel,
Liu Hong,
Zheng Bin
Publication year - 2015
Publication title -
medical physics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.473
H-Index - 180
eISSN - 2473-4209
pISSN - 0094-2405
DOI - 10.1118/1.4922681
Subject(s) - region of interest , reproducibility , artificial intelligence , computer science , pixel , pattern recognition (psychology) , preprocessor , computer aided diagnosis , feature (linguistics) , cad , content based image retrieval , image retrieval , mathematics , image (mathematics) , statistics , linguistics , philosophy , engineering drawing , engineering
Purpose: To develop a new computer‐aided diagnosis (CAD) scheme using a content‐based image retrieval (CBIR) approach for classification between the malignant and benign breast lesions depicted on the digital mammograms and assess CAD performance and reproducibility. Methods: An image dataset including 820 regions of interest (ROIs) was used. Among them, 431 ROIs depict malignant lesions and 389 depict benign lesions. After applying an image preprocessing process to define the lesion center, two image features were computed from each ROI. The first feature is an average pixel value of a mapped region generated using a watershed algorithm. The second feature is an average pixel value difference between a ROI's center region and the rest of the image. A two‐step CBIR approach uses these two features sequentially to search for ten most similar reference ROIs for each queried ROI. A similarity based classification score was then computed to predict the likelihood of the queried ROI depicting a malignant lesion. To assess the reproducibility of the CAD scheme, we selected another independent testing dataset of 100 ROIs. For each ROI in the testing dataset, we added four randomly queried lesion center pixels and examined the variation of the classification scores. Results: The area under the ROC curve (AUC) = 0.962 ± 0.006 was obtained when applying a leave‐one‐out validation method to 820 ROIs. Using the independent testing dataset, the initial AUC value was 0.832 ± 0.040, and using the median classification score of each ROI with five queried seeds, AUC value increased to 0.878 ± 0.035. Conclusions: The authors demonstrated that (1) a simple and efficient CBIR scheme using two lesion density distribution related features achieved high performance in classifying breast lesions without actual lesion segmentation and (2) similar to the conventional CAD schemes using global optimization approaches, improving reproducibility is also one of the challenges in developing CAD schemes using a CBIR based regional optimization approach.