I apologize first that i cannot write English properly.
I wrote a Neural Network program that uses cudnn and cublas.
This program does rigth calculate on the GTX 1080(I have Got two Card) and on the GTX 960M.
My first 1080 Card is on the PClEx16 and second 1080 card is on the PCLEx1.
I disabled second card on the Device Manager.And Problem was solved.
Now I can Get true results
Every time gives me the rigth results both devices .
The program is running slower on the GTX 1080 and runnning faster on the GTX 960M.
when I run the program with NSigth I see slow functions working with cublas .
I tried to explain this is in the picture on the link below.
Same blas functions run slower on the 1080 .
The functions take the same values in the both devices and they call at the same number.
Only some cublas functions run slowly and my function.
I marked it in the picture.
How can i solve this problem,can this cause problems?