If code in a .cu file calls CUDA Runtime API functions but contains no ‘__device__’ code, rename the file to .cpp and compile it with the host compiler. You’ll get faster compilation time, which adds up in a large project. […]
Learn how to use CUDA Graphs to make your application run faster and more efficiently. This video walkthrough shows you how to create CUDA Graphs by the Stream Capture Method and the Explicit API method. It also includes source code.
DeepStream SDK 4.0 is out now: https://developer.nvidia.com/deepstream-sdk It can run on Tesla T4, V100, Jetson TX1, TX2, Nano and AGX Xavier. Enjoy!
A 10 minute video discussion about nvprof from the command prompt.
A 40 minute video discussion about CUDA Streams, including example code.
A 13 minute video discussion about useful CUDA debugging techniques. Includes example code.
A video walkthrough of natively installing NVIDIA DIGITS on the Ubuntu 18.04 LTS operating system. NVIDIA DIGITS can be used to create inference models for the Jetson Xavier Developer Kit.
If you have installed CUDA Toolkit 10 on your machine, you have access to several libraries that can make simple tasks like sorting numbers in an array really fast by using your CUDA-enabled NVIDIA GPU. Again, once CUDA Toolkit 10 […]
A video walkthrough of running a ray tracing example on an NVIDIA GPU using CUDA. Windows machine.