I do not now what occurs… but CUBLAS stopped working…
Whenever I try to execute simpleCUBLAS it starts and does not responds. the same situtation with my own code - whenever it application comes to use function from cublas it stops working
I have cubals.h included and also in Linker cublas.lib
When it reaches the point when CUBLAS function should be called - it stops and there is black window of project_name.exe . If I want to close it - there is another window sayng that the application was stoped and I may break or contiunue.
f.i. when I execute simpleCUBLAS there is
simpleCUBLAS is running…
_
and nothing happen for anoter few minutes?!
I have already reinstalled driver/toolkit/sdk to CUDA 3.1 - it did not help.
When it reaches the point when CUBLAS function should be called - it stops and there is black window of project_name.exe . If I want to close it - there is another window sayng that the application was stoped and I may break or contiunue.
f.i. when I execute simpleCUBLAS there is
simpleCUBLAS is running…
_
and nothing happen for anoter few minutes?!
I have already reinstalled driver/toolkit/sdk to CUDA 3.1 - it did not help.
which every time takes about 3-4minutes…any ideas why ?
I found it that if in project there is no CUBLAS functions there is none of such 3-4 minutes delay, but if there is even single CUBLAS function there is such a delay?!
which every time takes about 3-4minutes…any ideas why ?
I found it that if in project there is no CUBLAS functions there is none of such 3-4 minutes delay, but if there is even single CUBLAS function there is such a delay?!
Help us help you - what operating system, driver version, hardware and host compiler are you using? It sounds a lot like there is driver level recompilation of PTX going on, which is why it is slow (especially of you host CPU isn’t very fast or you don’t have a lot of free memory). Are you running a Fermi GPU or something else?
Help us help you - what operating system, driver version, hardware and host compiler are you using? It sounds a lot like there is driver level recompilation of PTX going on, which is why it is slow (especially of you host CPU isn’t very fast or you don’t have a lot of free memory). Are you running a Fermi GPU or something else?
I am betting it is the same problem discussed in this thread. Make sure that you are building all your code for sm_20 and that you haven’t got JIT compilation forced on.
I am betting it is the same problem discussed in this thread. Make sure that you are building all your code for sm_20 and that you haven’t got JIT compilation forced on.