Strange behavior of TeslaC2050

kelson · November 18, 2010, 3:45pm

Hi,

I was using Quadro FX4800. Yesterday my"baby" reached my lab…Now I start enjoying it.

It is the Tesla C2050.

However there is a strange behavior I don’t understand.

I can’t get anything than 0 when run my cubals programs.

I notice that the program works well on the previous card.

cat /proc/driver/nvidia/version comand gives me

Nvidia x86_64 kernel module 260.19.12

gcc version 4.1.2

I’m trying to figure out what happen but can’t find anything.

Please any help will be wellcome.

The sample code of Hendrik Lensch…

#include<stdio.h>

#include<stdlib.h>

#include"cublas.h"

int main()

{

	float *h_a, *h_b,*h_c;

	float *d_a, *d_b,*d_c;

	float alpha=1.0f, beta = 0.0f;

	int N = 10, n2 = N*N;

	int nBytes = n2*sizeof(float),i;

	h_a = (float*)malloc(nBytes);

	h_b = (float*)malloc(nBytes);

	h_c = (float*)malloc(nBytes);

	for(int i = 0; i < n2; i++){

		h_a[i] = rand()/(float)RAND_MAX;

		h_b[i] = rand()/(float)RAND_MAX;

	}

	cublasInit();

	cublasAlloc(n2, sizeof(float),(void**)&d_a);

	cublasAlloc(n2, sizeof(float),(void**)&d_b);

	cublasAlloc(n2, sizeof(float),(void**)&d_c);

	

	cublasSetVector(n2,sizeof(float),h_a,1, d_a,1);

	cublasSetVector(n2,sizeof(float),h_b,1, d_b,1);

	cublasSgemm('n','n',N,N,N,alpha,d_a,N,d_b,N,beta,d_c,N);

	cublasGetVector(n2,sizeof(float),d_c,1,h_c,1);

	

	for(int i = 0; i < 10; i++){

		for(int j = 0; j < 10; j++){

			printf("% lf",h_c[i + j * N]);

		}

		printf("\n");

	}

	cublasShutdown();

	return 0;

}

kelson · November 18, 2010, 3:45pm

Hi,

I was using Quadro FX4800. Yesterday my"baby" reached my lab…Now I start enjoying it.

It is the Tesla C2050.

However there is a strange behavior I don’t understand.

I can’t get anything than 0 when run my cubals programs.

I notice that the program works well on the previous card.

cat /proc/driver/nvidia/version comand gives me

Nvidia x86_64 kernel module 260.19.12

gcc version 4.1.2

I’m trying to figure out what happen but can’t find anything.

Please any help will be wellcome.

The sample code of Hendrik Lensch…

#include<stdio.h>

#include<stdlib.h>

#include"cublas.h"

int main()

{

	float *h_a, *h_b,*h_c;

	float *d_a, *d_b,*d_c;

	float alpha=1.0f, beta = 0.0f;

	int N = 10, n2 = N*N;

	int nBytes = n2*sizeof(float),i;

	h_a = (float*)malloc(nBytes);

	h_b = (float*)malloc(nBytes);

	h_c = (float*)malloc(nBytes);

	for(int i = 0; i < n2; i++){

		h_a[i] = rand()/(float)RAND_MAX;

		h_b[i] = rand()/(float)RAND_MAX;

	}

	cublasInit();

	cublasAlloc(n2, sizeof(float),(void**)&d_a);

	cublasAlloc(n2, sizeof(float),(void**)&d_b);

	cublasAlloc(n2, sizeof(float),(void**)&d_c);

	

	cublasSetVector(n2,sizeof(float),h_a,1, d_a,1);

	cublasSetVector(n2,sizeof(float),h_b,1, d_b,1);

	cublasSgemm('n','n',N,N,N,alpha,d_a,N,d_b,N,beta,d_c,N);

	cublasGetVector(n2,sizeof(float),d_c,1,h_c,1);

	

	for(int i = 0; i < 10; i++){

		for(int j = 0; j < 10; j++){

			printf("% lf",h_c[i + j * N]);

		}

		printf("\n");

	}

	cublasShutdown();

	return 0;

}

avidday · November 18, 2010, 4:05pm

What version of cublas are you using?

avidday · November 18, 2010, 4:05pm

What version of cublas are you using?

kelson · November 18, 2010, 4:27pm

Hi sir Avidday

I’m using 3.1

kelson · November 18, 2010, 4:27pm

Hi sir Avidday

I’m using 3.1

Crankie · November 19, 2010, 6:07am

Hi,

You may include error checking at each of the CUBLAS API calls to see if something is failing :)

Crankie · November 19, 2010, 6:07am

Hi,

You may include error checking at each of the CUBLAS API calls to see if something is failing :)

LSChien · November 19, 2010, 6:09am

could you check error code returned by sgemm?

LSChien · November 19, 2010, 6:09am

could you check error code returned by sgemm?

Sarnath · November 19, 2010, 9:20am

-arch 2.0 or 2.1

Sarnath · November 19, 2010, 9:20am

-arch 2.0 or 2.1

kelson · November 19, 2010, 9:28am

Thank you for your reply.

Actually I added the error verification on the real code.

After dowloading and reinstalling the latest driver and recompiled the SDK codes

I got the expected results.

However I dont still understand why this happened.

If someone could tell me why that will be nice for me.

Thank you again

kelson · November 19, 2010, 9:28am

Thank you for your reply.

Actually I added the error verification on the real code.

After dowloading and reinstalling the latest driver and recompiled the SDK codes

I got the expected results.

However I dont still understand why this happened.

If someone could tell me why that will be nice for me.

Thank you again