Speed problem on 295 gtx cards

Mange · January 7, 2010, 1:47pm

Thanks again for your input. I made a very simple program to see if it crashed with that as well and to my surprise it did. I paste the code for this simple ( and totally useless ) program below. If I start one run in this code it finishes without any problem, but if I start a new run while the first one still is running the first one crashes with an unspecified launch failure. I would very grateful if someone could try this code on their computer (if they have more than one device for calculation) and launch at least 2 runs of the code at the same time. Perhaps Im doing something wrong in this simple code as well? Otherwise the problem should be somewhere else but in the code.

The reason for the massive loop is just that I want the execution to stay on the GPU for a while.

[codebox]#include

#include <assert.h>

using namespace std;

global void VecAdd(float a, float b, float c, float* kD) {

float d = 0;

for(int i=0; i<100000000; i++)

    d += a+b+c+ kD[8]+i;

kD[0]=d;

}

int main() {

float minarray[20] = {114,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20};

float *keeperDevice;

cudaError t1 = cudaMalloc((void**)&keeperDevice, 20*sizeof(float) );

assert(t1==cudaSuccess);

t1 = cudaMemcpy(keeperDevice, &minarray, 20*sizeof(float), cudaMemcpyHostToDevice);

assert(t1==cudaSuccess);

int cgd = 0;

int cgdc = 0;

t1 = cudaGetDevice(&cgd);

assert(t1==cudaSuccess);

t1 = cudaGetDeviceCount(&cgdc);

assert(t1==cudaSuccess);

cout << "cudaGetDevice: " << cgd << endl;

cout << "cudaGetDeviceCount: " << cgdc << endl;

VecAdd<<<128, 128>>>(5.0,6.0,7.0,keeperDevice);

t1 = cudaMemcpy(&minarray, keeperDevice, sizeof(float)*20, cudaMemcpyDeviceToHost);

assert(t1==cudaSuccess);

cout << “Done”;

}[/codebox]

Topic		Replies	Views
Failure with independent devices on independent processes Try it yourself! CUDA Programming and Performance	19	3556	March 10, 2011
Two 8800 GTX cards with Intel Core 2 Duo would this work? CUDA Programming and Performance	19	13156	October 2, 2007
Kernels launch - parallel or serial? CUDA Programming and Performance	16	7002	January 11, 2010
Using more than 1 CUDA card at a time. Physics simulations flat out flying on GPU CUDA Programming and Performance	12	12637	March 12, 2010
Different performance from different GPUs with Identical Code CUDA Programming and Performance	18	4473	April 11, 2012
multi gpu + exclusive mode + matlab, can't run two processes - kernel crashes CUDA Programming and Performance	39	9377	July 1, 2010
Is it possible to execute two kernels concurrently? CUDA Programming and Performance	18	6763	July 2, 2010
2 GTX295 SLI Nqueens project CUDA Programming and Performance	31	18000	February 18, 2009
CUDA Screen freeze with 1 graphics Card CUDA Programming and Performance	37	52105	June 17, 2011
Exclusive compute mode doesn't work with multiple GTX295's & 64-bit Linux CUDA Programming and Performance	2	2724	September 17, 2009

Speed problem on 295 gtx cards

Related topics