I apologize first that i cannot write English properly.
I wrote a Neural Network program that uses cudnn and cublas.
This program does rigth calculate on the GTX 1080(I have Got two Card) and on the GTX 960M.
SOLUTUION:
My first 1080 Card is on the PClEx16 and second 1080 card is on the PCLEx1.
I disabled second card on the Device Manager.And Problem was solved.
Now I can Get true results
Every time gives me the rigth results both devices .
The program is running slower on the GTX 1080 and runnning faster on the GTX 960M.
when I run the program with NSigth I see slow functions working with cublas .
I tried to explain this is in the picture on the link below.
Same blas functions run slower on the 1080 .
The functions take the same values in the both devices and they call at the same number.
Only some cublas functions run slowly and my function.
I marked it in the picture.
How can i solve this problem,can this cause problems?