z-logo
open-access-imgOpen Access
DeepSense
Author(s) -
Loc N. Huynh,
Rajesh Krishna Balan,
Youngki Lee
Publication year - 2016
Publication title -
singapore management university institutional knowledge (ink) (singapore management university)
Language(s) - English
Resource type - Conference proceedings
ISBN - 978-1-4503-4326-8
DOI - 10.1145/2935643.2935650
Subject(s) - computer science , deep learning , convolutional neural network , scalability , artificial intelligence , variety (cybernetics) , class (philosophy) , vectorization (mathematics) , computer architecture , machine learning , object detection , computer engineering , parallel computing , pattern recognition (psychology) , database
Recently, a branch of machine learning algorithms called deep learning gained huge attention to boost up accuracy of a variety of sensing applications. However, execution of deep learning algorithm such as convolutional neural network on mobile processor is non-trivial due to intensive computational requirements. In this paper, we present our early design of DeepSense - a mobile GPU-based deep convolutional neural network (CNN) framework. For its design, we first explored the differences between server-class and mobile-class GPUs, and studied effectiveness of various optimization strategies such as branch divergence elimination and memory vectorization. Our results show that DeepSense is able to execute a variety of CNN models for image recognition, object detection and face recognition in soft real time with no or marginal accuracy tradeoffs. Experiments also show that our framework is scalable across multiple devices with different GPU architectures (e.g. Adreno and Mali).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom