DeepSense | Zendy

Loc N. Huynh | Zendy; Rajesh Krishna Balan | Zendy; Youngki Lee | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

DeepSense

Author(s) -

Loc N. Huynh,

Rajesh Krishna Balan,

Youngki Lee

Publication year - 2016

Publication title -

singapore management university institutional knowledge (ink) (singapore management university)

Language(s) - English

Resource type - Conference proceedings

ISBN - 978-1-4503-4326-8

DOI - 10.1145/2935643.2935650

Subject(s) - computer science , deep learning , convolutional neural network , scalability , artificial intelligence , variety (cybernetics) , class (philosophy) , vectorization (mathematics) , computer architecture , machine learning , object detection , computer engineering , parallel computing , pattern recognition (psychology) , database

Recently, a branch of machine learning algorithms called deep learning gained huge attention to boost up accuracy of a variety of sensing applications. However, execution of deep learning algorithm such as convolutional neural network on mobile processor is non-trivial due to intensive computational requirements. In this paper, we present our early design of DeepSense - a mobile GPU-based deep convolutional neural network (CNN) framework. For its design, we first explored the differences between server-class and mobile-class GPUs, and studied effectiveness of various optimization strategies such as branch divergence elimination and memory vectorization. Our results show that DeepSense is able to execute a variety of CNN models for image recognition, object detection and face recognition in soft real time with no or marginal accuracy tradeoffs. Experiments also show that our framework is scalable across multiple devices with different GPU architectures (e.g. Adreno and Mali).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research