I’ve just installed cuda 3.0 and want to test a simple program. I called cublas_sgemm to multiply two 22 matrices(B=AA), which turned out that the routine did nothing. I mean the output matrix B is just the same as the input matrix A. It’s strange! Previously I tried cublas 2.3 on ubuntu 8.04, and this test passed through.
Then I wondered whether or not it caused from the version. Then I tested cuda 2.3 of RHEL 5.3. This time the program leaded to segmentation fault…
Anyone knows the reason. I’m going insane… thanks…