A very simple example of transferring data from the device (GPU) to the host (CPU) using CUDA. If you have any questions, contact me on twitter @cudaeducation or comment on the YouTube video. Download code: CUDA Transfer Data from Host to […]
Category: Tutorial
GPU vs CPU Programming | CUDA Programming Introduction | nVidia CUDA Overview | YouTube Video Walkthrough
A quick comparison of GPU programming vs. CPU programming and the battles one has to face when programming in parallel.
CUDA Dynamic Parallelism using Visual Studio 2017 on a Windows based machine
UPDATE: This video and content applies to CUDA Toolkit 9. I have since moved to CUDA Toolkit 10 and I didn’t have any of these issues. So you are combing through the internet trying to find a way to successfully […]
CUDA Programming Example | 20 Million Array Addition | CUDA Tutorial | CUDA Array Addition [UPDATED]
The following code is basically taking an array with 20 million integers and adding all the numbers together to get a final answer. I have heavily commented the code for your convenience. CREDIT: Professional CUDA C Programming by John […]
Thread Hierarchies in CUDA | Sorting out the mess
GPU = Graphics Processing Unit (ex. Nvidia GeForce GTX 1050 Ti) CPU = Central Processing Unit (ex. Intel Core i7) When programming your graphics processor, there are a lot more moving parts compared to programming on the CPU. Coders go […]