help-adding multiple vectors, undetermined number and size

iseeplusplus · January 8, 2012, 6:58pm

I’m a beginner to parallel computing and openCL. I would like to figure out how I can write a kernel, or multiple kernels to add a variable amount of vectors of a variable size. I can’t find any examples which demonstrate the tricks necessary to accomplish this.

for(n=0; n < SmallNumber; ++n) {    

    for (n2=0; n2 < LargeNumber; ++n2) {

        A[n2]+=B[n][n2];

    }                                                               

}

I realize that you cannot pass a 2 dimensional vector to an openCL kernel, changed it to this.

int n,n2,n3,z,x=12,y=20000000;

int A[y];

int B[x][y];

int 1dB[x*y];

//initialize A...

//convert B to one dimension

for (n=0, z=0; n < x; ++n , z+=y) {         

    for (n2=z, n3=0; n2 < z+y; ++n2, ++n3) {

        1dB[n2]=B[n][n3];

    }

}

for (n=0, z=0; n < x; ++n, z+=y) {   

    for (n2=z, n3=0; n2 < z+y; ++n2, ++n3) {        

        A[n3]+=1dB[n2];

    }                   

}

So now I don’t have the problem with 2 dimensional vectors, but I think there are a lot of other issue I’ll need to address.

Anyone have any suggestions or examples? It seams like it should be a fairly simple process, but I’ve become kind of confused trying to figure this out.

__kernel void openCL_Kernel( __global  int *A,

                         __global  int **B,  

                         __global  int *C) 

{

int i=get_global_id(0);

int ii=get_global_id(1);

A[i]+=B[ii][i];

}

Other than the fact I cannot pass a 2 dimensional pointer, would this be equivalent assuming I define the work sizes appropriately?

edit: I just realized that I would break the openCL vector size limit if I tried to pass a single vector holding all the data.

Do you think this problem would be significantly easier using cuda?

Topic		Replies	Views
Problem with vector addition example The program doesn't work the way described CUDA Programming and Performance	0	702	October 4, 2011
Error in clEnqueueNDRangeKernel() CUDA Programming and Performance	1	6660	March 18, 2010
Problem with Vectors add Can't compute sum of two vectors CUDA Programming and Performance	4	1648	March 16, 2009
passing an array of char* to kernel function CUDA Programming and Performance	5	13163	November 7, 2010
2D Array using OpenCL Arrays vs Images & How-to CUDA Programming and Performance	12	52778	August 12, 2010
Maximum size for a vector addition program Using structs CUDA Programming and Performance	0	1048	March 28, 2010
Why does my streaming vector add fails? CUDA Programming and Performance	2	2729	August 26, 2011
OpenCL VectorAdd demo buffers confusion CUDA Programming and Performance	0	12605	May 18, 2009
Execute one ernel with different attributes at the same time CUDA Programming and Performance	7	6190	December 16, 2010
Help: adding two arrays (beginner) CUDA Programming and Performance	0	1480	March 20, 2010

help-adding multiple vectors, undetermined number and size

Related topics