CUDACasts Episode #12: Programming GPUs using CUDA Python

Originally published at:

So far in the CUDA Python mini-series on CUDACasts, I introduced you to using the @vectorize decorator and CUDA libraries, two different methods for accelerating code using NVIDIA GPUs. In today’s CUDACast, I’ll be demonstrating how to use the NumbaPro compiler from Continuum Analytics to write CUDA Python code which runs on the GPU. In CUDACast…

Hello Mark,

I really enjoy the Cudacasts. I was wondering if the script you're using is available anywhere. Copying it from the video can be difficult. Particularly, when I try to run my version of your code, I get the error that the device strides and host strides don't match at the d_next.copy_to_host command.