How to obtain the maximum value of the sequence by using the reduction algorithm？

freshboy · May 5, 2019, 8:08am

global void absMaxval(float *dst, float *src, int size)
{
shared float cache[blocksizeMax];

int g_id = (blockDim.x * blockIdx.x) +  threadIdx.x;
int l_id =  threadIdx.x;

cache[l_id] =  fabs(src[g_id]);
__syncthreads();

for (unsigned int s = blockDim.x / 2; s > 0; s >>= 1)
{
	__syncthreads();
	if (l_id < s && l_id < size)
		cache[l_id] = max(cache[l_id], cache[l_id + s]);
	
	__syncthreads();
}

if (l_id == 0)
	dst[blockIdx.x] = cache[0];

}

Above is my code. I feel it is not faster than that copying the sequence from GPU to CPU and obtain the maximum on CPU when the length of sequence is not small. But i do not know why ，please help

saulocpp · May 6, 2019, 8:22am

Search for thrust extrema max_element.
It is ready for use.

Topic		Replies	Views
Cumpute Max of Vector or Matrix CUDA Programming and Performance	7	3842	June 6, 2011
Finding max in array CUDA Programming and Performance	15	42929	November 26, 2017
Reduction in kernel function Get max value in vector CUDA Programming and Performance	8	13871	May 5, 2011
CUDA reduction CUDA Programming and Performance	10	51567	June 7, 2009
different output every time I run my code probably wrong in finding max value. CUDA Programming and Performance	4	3117	June 21, 2011
Finding maximum element of array CUDA Programming and Performance	1	4343	March 30, 2011
[SOLVED] Finding the maximum values with CUDA CUDA Programming and Performance	4	9223	October 13, 2017
How to realize that CUDA Programming and Performance	6	1949	February 16, 2010
Find maximum value from threads CUDA Programming and Performance	6	570	December 16, 2023
Max() for massive arrays? CUDA Programming and Performance	3	3207	June 4, 2009

How to obtain the maximum value of the sequence by using the reduction algorithm？

Related topics