Noob needs advice

Zlon · February 7, 2015, 2:33pm

Hello! I’ve just start to learn Cuda for one verz narrow task. Could You give me a piece of advice whether CUDA will help me in my task. Generally speaking I want to perform the following calculations:

double fun2(double a)
{
x[10^5];
x[0]=0;
for (int i=1;i<10^5; i++)
{
x[i]=FN(x[i-1],a);
}

}

double fun1()
{
double a[10^5],x[10^5][10^5];
int n=10^5;

for (iA=0; iA<n; iA++)
{
x[iA]=fun2(a[iA]);
}
return x;
}

What i should read? I see 2 inserted kernels. But in this documentation i read they provide no examples of such kernels. Seconly, i could not understand how many blocks and threads to use. And finally, i read that threadshave very small memory, but my computation is very big is it a problem for CUDA.

Million thanks in advance.

njuffa · February 7, 2015, 3:15pm

Applying a mathematical function to an array of data is a task commonly performed with CUDA. The size of such an array will be limited by the amount of memory on your graphics card. Your second example would appear to require about 800 MB, so you would need a GPU with at least 1 GB of memory, which is something even low-end GPUs typically provide these days.

Have you had a chance to look at the many example programs that ship with CUDA? They cover a wide range from simple to advanced topics. You may also benefit from reading an introductory book, such as “CUDA by Example”.

I assume when you ask “whether CUDA will help me in my task” you are inquiring whether CUDA will accelerate the task compared to your current CPU version. The answer is, “it depends”. If you plan to simply copy the data from the CPU to the GPU, apply fun2() to it and copy the result back to the CPU, the answer is likely “no”, unless fun2() is very computational intensive. The performance of such code would be limited by the speed of the copies across the PCIe interconnect that connects the CPU and the GPU. If the above code sketch is part of a larger computation that you plan to move to the GPU in its entirety, performing this computation on the GPU should help even if the kernel turns out to be bound by memory bandwidth, since GPUs generally provide higher memory bandwidth than CPUs.

Topic		Replies	Views
CUDA is slower than expected. Is something missing? CUDA Programming and Performance cuda , gpu , gpu-computing , parallel-computing	4	242	July 7, 2024
CUDA principals - summary CUDA Programming and Performance	0	335	September 1, 2018
Help on fixing some poor performances (rookie) CUDA Programming and Performance	10	7164	November 28, 2007
Is CUDA right for my application? CUDA Programming and Performance	2	1763	January 4, 2010
CUDA and fixed-point comparaison on big array Is CUDA suitable for fixed-point comparaison? CUDA Programming and Performance	7	2498	May 9, 2011
Confused about GPU vs CPU speed in multiplication CUDA Programming and Performance	8	6547	February 19, 2009
CUDA - NonCUDA GPUs Hardware Configurations CUDA Programming and Performance	4	2029	February 27, 2010
Grids and Threads question CUDA Programming and Performance	2	4421	August 7, 2007
GPU vs. CPU GPU is always much slower CUDA Programming and Performance	1	10270	June 5, 2009
Simple test, unexpected results: more calculations in each thread, less GPU occupancy time! CUDA Programming and Performance	5	1127	May 27, 2013

Noob needs advice

Related topics