Learn how to use CUDA Pinned Memory to make your applications run faster. You can’t process if you don’t have the data! Pinned memory is used give your GPU data faster so it can keep busy processing. This is a […]
programming with a smile
When I try to run CUDA code that takes a long time to process on the GPU, I would always get an error such as the following: Error: C:/kernel.cu:170, code: 4, reason: unspecified launch failure After spending many sleepless nights […]
NVIDIA CEO explains Dynamic Parallelism
A video walkthrough of how to go about installing TensorFlow + TensorBoard + Keras + Anaconda Python on a Windows based machine. The video walkthrough is 19+ minutes long.
Learn how to use cooperative groups to make your parallel processing code more organized and manageable. The video walkthrough is 32+ minutes long and includes example source code.
A detailed video walkthrough (16+ min.) of how to start programming in NVIDIA CUDA.