A very basic video walkthrough of how to use Nsight Systems to help in optimizing your application. Nsight Systems is software from NVIDIA and is mainly intended to work with the NVIDIA graphics cards and the CUDA programming language. This […]
Tag: cuda programming
CUDA Graphs Tutorial | How to launch CUDA Graph by Stream Capture & Explicit API Method | Video Walkthrough (57+ min.) | Includes Source Code
Learn how to use CUDA Graphs to make your application run faster and more efficiently. This video walkthrough shows you how to create CUDA Graphs by the Stream Capture Method and the Explicit API method. It also includes source code.
How to Start Programming your NVIDIA graphics card using CUDA | CUDA Tutorial | GeForce Programming | CUDA Programming
Donate A quick overview of how to program your NVIDIA graphics card using the CUDA programming language. CUDA Toolkit 9 and CUDA Toolkit 10. To get started, visit cudaeducation.com/howtoprogramcuda Teaching & Consulting sessions available cudaeducation@gmail.com Next: CUDA Dynamic […]
Video Walkthrough (21+ min.) of using CUDA Pinned Memory | CudaMallocHost | Make your applications run faster
Learn how to use CUDA Pinned Memory to make your applications run faster. You can’t process if you don’t have the data! Pinned memory is used give your GPU data faster so it can keep busy processing. This is a […]
Transfer Data from Device to Host using CUDA | CUDA Education | cudaMalloc | CUDA Tutorial |
A very simple example of transferring data from the device (GPU) to the host (CPU) using CUDA. If you have any questions, contact me on twitter @cudaeducation or comment on the YouTube video. Download code: CUDA Transfer Data from Host to […]
GPU vs CPU Programming | CUDA Programming Introduction | nVidia CUDA Overview | YouTube Video Walkthrough
A quick comparison of GPU programming vs. CPU programming and the battles one has to face when programming in parallel.
NVIDIA DeepStream SDK | Video Analytics Application Development | CUDA TensorRT
Use the nVidia DeepStream SDK to identify objects such as a car’s color, type and make using artificial intelligence. It uses CUDA to pre-process the data and gain insights through TensorRT. It is available for nVidia Tesla and Jetson. DeepStream […]
CUDA GPU Occupancy Calculator
Just discovered the CUDA GPU Occupancy Calculator spreadsheet that will help you increase the occupancy ratio of your applications. Occupancy is defined as [active warps]/[maximum warps]. You will always want your device cores to be used to its fullest potential […]
Thread Hierarchies in CUDA | Sorting out the mess
GPU = Graphics Processing Unit (ex. Nvidia GeForce GTX 1050 Ti) CPU = Central Processing Unit (ex. Intel Core i7) When programming your graphics processor, there are a lot more moving parts compared to programming on the CPU. Coders go […]