Analyzing the Resilience of Convolutional Neural Networks Implemented on GPUs | Zendy

Khalid Adam | Zendy; Izzeldin I. Mohd | Zendy; Yusuf Ibrahim | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Analyzing the Resilience of Convolutional Neural Networks Implemented on GPUs

Author(s) -

Khalid Adam,

Izzeldin I. Mohd,

Yusuf Ibrahim

Publication year - 2021

Publication title -

international journal of electrical and computer engineering systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.141

H-Index - 4

eISSN - 1847-7003

pISSN - 1847-6996

DOI - 10.32985/ijeces.12.2.4

Subject(s) - computer science , convolutional neural network , compiler , kernel (algebra) , fault injection , soft error , resilience (materials science) , reliability (semiconductor) , parallel computing , latency (audio) , overhead (engineering) , computer engineering , embedded system , artificial intelligence , operating system , software , telecommunications , power (physics) , physics , mathematics , combinatorics , quantum mechanics , electronic engineering , engineering , thermodynamics

There have been an extensive use of Convolutional Neural Networks (CNNs)in healthcare applications. Presently, GPUs are the most prominent and dominated DNN accelerators to increase the execution speed of CNN algorithms to improve their performance as well as the Latency. However, GPUs are prone to soft errors. These errors can impact the behaviors of the GPU dramatically. Thus, the generated fault may corruptdata values or logic operations and cause errors, such as Silent Data Corruption. unfortunately, soft errors propagate from the physical level(microarchitecture) to the application level (CNN model). This paper analyzes the reliability of the AlexNet model based on two metrics: (1) critical kernel vulnerability (CKV) used to identify the malfunction andlight- malfunction errors in each kernel, and (2) critical layer vulnerability (CLV) used to track the malfunction and light-malfunction errors through layers. To achieve this, we injected the AlexNet which was popularly used in healthcare applications on NVIDIA’s GPU, using theSASSIFI fault injector as the major evaluator tool. The experiments demonstrate through the average error percentage that caused malfunctionof the models has been reduced from 3.7% to 0.383% by hardening only the vulnerable part with the overhead only 0.2923%. This is a high improvement in the model reliability for healthcare applications.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore