clGetDeviceInfo() returns an unexpected reply for parameter CL_DEVICE_MAX_COMPUTE_UNITS

weliad · August 26, 2010, 8:04pm

Hi,

When querying my device for the CL_DEVICE_MAX_COMPUTE_UNITS, I got what I find to be a strange reply. The returned value was in terms of SMs rather than actual CUDA cores. In my device (GT8600) there’re 4 SMs each built from 8 CUDA cores. I expected to get 32, but the reply was 4.

Is this the expected behavior? Can someone confirm?

Thanks,
– Liad Weinberger.

daemonized · August 27, 2010, 5:28am

That should be fine. Per definition, a compute unit manages a single work-group and that matches an SM in NVidia’s architecture. __local memory is shared among all work-items of a work-group and that matches an NVidia SM as well.

daemonized · August 27, 2010, 5:28am

That should be fine. Per definition, a compute unit manages a single work-group and that matches an SM in NVidia’s architecture. __local memory is shared among all work-items of a work-group and that matches an NVidia SM as well.

weliad · August 27, 2010, 6:07am

Got it! Thank you for the confirmation.

weliad · August 27, 2010, 6:07am

Got it! Thank you for the confirmation.

Topic		Replies	Views
MAX_WORK_ITEM_SIZES[2] == 1 CUDA Programming and Performance	0	2198	May 15, 2009
Changes in Interpretation of DeviceInfo values CUDA Programming and Performance	5	9542	April 27, 2010
Very small work group size CUDA Setup and Installation	4	1930	April 10, 2015
Capability issue CUDA Programming and Performance	2	5636	June 19, 2008
LOCAL MEM SIZE is per compute unit? newbie question CUDA Programming and Performance	1	1205	February 1, 2011
Questions regarding the OpenCL compute units CUDA Programming and Performance	2	4610	July 27, 2010
nvidia-smi reports 3 GPUs but deviceQuery reports only 2 CUDA Setup and Installation	4	2017	June 23, 2018
CL_DEVICE_MAX_COMPUTE_UNITS on Geforce 310m! CUDA Programming and Performance	13	27150	November 22, 2010
What is the "Number of Streaming Multiprocessors (SM)"? CUDA Programming and Performance	3	4784	August 1, 2010
Calculating CUDA cores CUDA Programming and Performance	4	2472	October 12, 2021

clGetDeviceInfo() returns an unexpected reply for parameter CL_DEVICE_MAX_COMPUTE_UNITS

Related topics