z-logo
open-access-imgOpen Access
Implementation of stereo matching using a high level compiler for parallel computing acceleration
Author(s) -
Jinglin Zhang,
Jean François Nezan,
Jean-Gabriel Cousin,
Erwan Raffin
Publication year - 2012
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2425836.2425892
Subject(s) - computer science , compiler , cuda , parallel computing , general purpose computing on graphics processing units , symmetric multiprocessor system , supercomputer , computer architecture , software , computational science , graphics , operating system
Heterogeneous computing systems increase the performance of parallel computing in many domains of general purpose computing with CPU, GPU and other accelerators. With Hardware developments, the software developments like Compute Unified Device Architecture (CUDA) and Open Computing Language (OpenCL) try to offer a simple and visual framework for parallel computing. But it turns out to be more difficult than programming on CPU platform for optimization of performance. For one kind of parallel computing application, there are different configurations and parameters for various hardware platforms. In this paper, we apply the Hybrid Multi-cores Parallel Programming (HMPP) to automatically generate tunable code for GPU platform and show the results of implementation of Stereo Matching with detailed comparison with C code version and manual CUDA version. The experimental results show that default and optimized HMPP have approximately the same performance and the better quality of disparity map compared with CUDA implementation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom