thrust::minmax_element on GPU produces different results than on CPU

Vectorizer · January 18, 2018, 5:21pm

gpu is a 1050Ti with compute capability 6.1, running cuda 8.0 on 64-bit windows 10.
The error is on the “min” side, had similar problem with thrust::reduce().
This happens with a certain dataset of floats, coming from an image luma value (HDR app)

Just to verify, I have written a very simple single threaded kernel to perform the min-reduction; its result agrees with the CPU.

The floats are not huge (or very small) values: the min values are like -6.61f vs -6.049f and the max is about 2.5f.

Anyone else had a similar experience with thurst?
Thanks in advance

Robert_Crovella · January 18, 2018, 6:08pm

Can you provide a self-contained reproducer code?

Vectorizer · January 18, 2018, 6:17pm

Hi Bob,

Thanks for the response.
The reproducer code may not be applicable, because I come across this behavior with some (not all) certain data sets (coming from images).
But here is a code snippet:

float minVal;
float maxVal;
//luminance is a float device pointer used with cudaMalloc() and actually been processed by a custom kernel
//length is the size of device memory pointed to by luminance

thrust::pair<thrust::device_ptr, thrust::device_ptr> tuple;
tuple = thrust::minmax_element(thrust::device,
thrust::device_pointer_cast((float*)luminance),
thrust::device_pointer_cast((float*)luminance) + length);
CHECK_CUDA_ERROR(errCode, functionName, “thrust::minmax_element failed”);
minVal = *(tuple.first);
maxVal = *(tuple.second);

Topic		Replies	Views
Thrust - Search Min GPU-Accelerated Libraries	1	845	March 4, 2016
thrust::min_element(...) crash on old GPU GPU-Accelerated Libraries	4	1439	August 8, 2015
Thrust::minmax_element slower than host implementation with OpenCV CUDA Programming and Performance opencv , cuda	10	2117	December 6, 2020
Value of sum from thrust::reduce not correct w.r.t CPU GPU-Accelerated Libraries cuda	1	410	January 30, 2021
Reduction Operation to find the Minimum CUDA Programming and Performance	4	10875	November 23, 2009
Problem defining the predicate for thrust Min_element, using zip_iterators for device_ptr CUDA Programming and Performance	1	933	May 30, 2013
Starter Question Gpu exec time vs Cpu exec time CUDA Programming and Performance	1	3210	February 16, 2012
thrust::reduce_by_key - issues with Maxwell devices GPU-Accelerated Libraries	8	3123	June 30, 2016
Finding max in array CUDA Programming and Performance	15	42869	November 26, 2017
call thrust::min_element function from cuda fortran Legacy PGI Compilers	5	5892	April 16, 2012

thrust::minmax_element on GPU produces different results than on CPU

Related topics