I have a 4 channel packed BGRA memory on the host device.
Using NPP is it possible to copy from host to GPU device ?
Is there some reason you can’t just use cudaMemcpy?
NPP has a Data Exchange and Initialization module.
Was wondering if these APIs support host to device copying and vice versa.
Yes finally used cudaMemcpy.