z-logo
open-access-imgOpen Access
Efficient and Interpretable Otoscopic Image Classification via Distilled CNN with Adaptive Channel Attention
Author(s) -
Zaka Ur Rehman,
Mohammad Faizal Ahmad Fauzi,
Farhaur Iman Lokman,
Meriem Touhami,
Lokman Saim
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3597769
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Accurate classification of otoscopic ear images is crucial for early diagnosis of ear pathologies such as Chronic Otitis Media, Earwax Plug, and Myringosclerosis. In this study, we propose a novel deep learning framework that employs a knowledge distillation strategy, wherein a high-capacity pre-trained teacher model (Vision Transformer or ResNet101) transfers learned representations to a lightweight student CNN model (EfficientNet-B0). The student network is further enhanced through the integration of an Adaptive Channel Attention (ACA) module, which selectively emphasizes informative features via channel-wise recalibration. The multi-scale feature distillation from the teacher improves generalization while the ACA block boosts sensitivity to clinically relevant regions. We validated our method on a publicly available otoscopic image dataset comprising four balanced classes. Our approach achieved an overall accuracy of 98.75%, with class-wise AUC values of 1.000 (CSOM), 1.000 (Earwax Plug), 0.994 (Myringosclerosis), and 0.992 (Normal), and a micro-average AUC of 0.997. Additionally, Grad-CAM analysis confirmed the model’s focus on diagnostically meaningful areas, supporting the interpretability of the predictions. These results demonstrate the effectiveness of our distillation-based ACA-enhanced architecture in otoscopic image classification, with potential to assist clinical decision-making in primary care and telemedicine applications.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom