I’ve a laptop with an internal graphics Intel HD 530 and a Nvidia GTX 1060 (Pascal) mobile edition.
I’ve been running some sample CUDA code examples and noticed that its performance readout using nvprof is fairly bad for the hardware I am running. I compared the result to my other laptop with a GTX 960M and it was day and night difference
When I run my test program, I get an error 10964 on Unified Memory Profiling. When I try the Visual Profiler Tool, I get a different warning number 8772 but the relatively same message. An image can be seen here http://i.imgur.com/Z0Ywmx5.png
I think this is due to it confusing my integrated and dedicated card. I thought the solution was to use the environment variable “CUDA_VISIBLE_DEVICES” and set it to 0(which I think is correct based on the second powershell window showing 0 for the GTX 1060).
Am I setting the environment variable correctly in Windows? Any other tips for how to fix it so that CUDA runs off the GTX 1060 and gets profiled correctly? We are going to be using this system for computer vision tasks so performance is vital.
Thank you for the help!