CUDA array filtering kernel without a for loop

s.yousefi.radi · May 26, 2021, 8:04am

I have a large array A with size_A rows and 6 columns. I am going to check the 4th element of each row, and if that is not zero, copy the row into another array B. Can I have the index to the entries of B without using a for loop, please see the below code?

I probably would need to define b_ptr somehow to make it static (similar to the what we have in C), but I think that is not allowed in CUDA.

__global__ void filtering_kernel(float* A, int size_A, float* B, float* size_B)
{
    /*B and size_B are the outputs*/
    int b_ptr = 0;
    int x = blockIdx.x * blockDim.x + threadIdx.x;
    if (x > size_A) return;
    for (int i = 0; i < size_A; i++)
    {
        if (A[x + 3] != 0)
       {
            B[b_ptr] = A[x + 0];
            B[b_ptr + 1] = A[x + 1];
            B[b_ptr + 2] = A[x + 2];
            B[b_ptr + 3] = A[x + 3];
            B[b_ptr + 4] = A[x + 4];
            B[b_ptr + 5] = A[x + 5];
            b_ptr += 6;
            *size_B = *size_B + 1;
        }
    }
}

Robert_Crovella · May 26, 2021, 2:01pm

cross posting here

Topic		Replies	Views
How to efficently copy the non-zero elements in an array to another array CUDA Programming and Performance cuda	1	513	December 28, 2023
Need help to implement my function CUDA Programming and Performance	0	932	June 18, 2012
How to index last element of a row/column of an array selectively index specific elements of an arra CUDA Programming and Performance	5	5934	November 9, 2010
[beginner] indexing an array in a non continous way Accessing every 3rd element in the array CUDA Programming and Performance	5	1238	April 10, 2012
How to put specific elements from one array to another array use CUDA? CUDA Programming and Performance cuda	6	1609	October 30, 2022
Array filtering in CUDA CUDA Programming and Performance	2	1585	November 26, 2018
Get first row index that meets the condition in cuda CUDA Programming and Performance	3	1383	April 15, 2016
Converting a for loop to cuda CUDA Programming and Performance	2	2166	June 14, 2012
Code don't work for big arrays. Kernel return with unasigned array values. CUDA Programming and Performance	2	493	January 28, 2019
Copying part of 2D array to device CUDA Programming and Performance	2	942	March 22, 2012

CUDA array filtering kernel without a for loop

Related topics