Project Name: CUDA Computing
This project is an appraisal of a computing environment based on Nvidia ARM and Intel for COTS and research applications. We will benchmark COTS (commercial off the shelf) applications including a few that have been optimized and certified for Nvidia Maximus (Quadro and Tesla CUDA). We will benchmark scientific and research applications on ARM, Quadro, and CUDA and attempt to fine tune the conversion of applications from running under x86 to ARM computing environment.
Furthermore, Semi-global matching (SGM) is one of the best ways for doing stereo matching in computer vision currently. We would like to have our own experience of applying CUDA to speed up SGM, and the comparison could also be drawn with belief propagation stereo on CUDA.
First of all, we need to know what is a CUDA Computing?
Compute Unified Device Architecture(CUDA) is a parallel computing platform and programming model invented by NVIDIA. It enables dramatic increases in computing performance by harnessing the power of the graphics processing unit (GPU).

The diagram shows how an application that normally runs in the CPU of a PC is ported over to the GPU.
At the center of this parallel computing model is CUDA (Compute United Device Architecture) which is NVIDIA’s parallel computing hardware and programming model. CUDA provides developer tools that have made parallel programming easy and accessible.