z-logo
open-access-imgOpen Access
RAt-CapsNet: A Deep Learning Network Utilizing Attention and Regional Information for Abnormality Detection in Wireless Capsule Endoscopy
Author(s) -
Md. Jahin Alam,
Rifat Bin Rashid,
Shaikh Anowarul Fattah,
Mohammad Saquib
Publication year - 2022
Publication title -
ieee journal of translational engineering in health and medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.653
H-Index - 24
ISSN - 2168-2372
DOI - 10.1109/jtehm.2022.3198819
Subject(s) - bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , signal processing and analysis , robotics and control systems , general topics for engineers
Background: The emergence of wireless capsule endoscopy (WCE) has presented a viable non-invasive mean of identifying gastrointestinal diseases in the field of clinical gastroenterology. However, to overcome its extended time of manual inspection, a computer aided automatic detection system is getting vast popularity. In this case, major challenges are low resolution and lack of regional context in images extracted from WCE videos. Methods: For tackling these challenges, in this paper a convolution neural network (CNN) based architecture, namely RAt-CapsNet, is proposed that reliably employs regional information and attention mechanism to classify abnormalities from WCE video data. The proposed RAt-CapsNet consists of two major pipelines: Compression Pipeline and Regional Correlative Pipeline. In the compression pipeline, an encoder module is designed using a Volumetric Attention Mechanism which provides 3D enhancement to feature maps using spatial domain condensation as well as channel-wise filtering for preserving relevant structural information of images. On the other hand, the regional correlative pipeline consists of Pyramid Feature Extractor which operates on image driven feature vectors to generalize and propagate local relationships of pixels from WCE abnormalities with respect to the normal healthy surrounding. The feature vectors generated by the pipelines are then accumulated to formulate a classification standpoint. Results: Promising computational accuracy of mean 98.51% in binary class and over 95.65% in multi-class are obtained through extensive experimentation on a highly unbalanced public dataset with over 47 thousand labelled. Conclusion: This outcome in turn supports the efficacy of the proposed methodology as a noteworthy WCE abnormality detection as well as diagnostic system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom