Why dead code in OpenCL kernel influence result in Nvidia GTX550ti? OpenCL

jxj · May 31, 2012, 9:29am

I am using OpenCL dev software of Nvidia on GTX550ti graphics card, and encounter a strange problem. (I am freshman for OpenCL).

My kernel code is like this:

__kernel void kernel_name(…)
{
size_t d = get_local_id(0);
char abc[8];
…
}

Actually, the “char abc[8]” is useless (dead code) for my case. But, if I have the “char abc[8]” in my kernel code, the result will be totally messy and the running time of kernel will be much longer (2095712 ns). If I comment out the “char abs[8]”, the result becomes correct, and the running time of kernel becomes shorter (697856 ns). The compiler of kernel won’t wipe off the dead code?

The above is just an explicit example that I can repeat. I also encounter more stranger case that one program gets different result when run at different time in totally the same environment.

Is that related to memory allocation or…? Anyone can give me some advices on how to find the problem?

By the way, oclDeviceQuery output information is listed as follows: Platform Version = OpenCL 1.1 CUDA 4.2.1, SDK Revision = 7027912

My OS is Windows XP.

Thank you.

Topic		Replies	Views
OpenCL kernel vs CUDA kernel why so different? I see very different performance for almost similar k CUDA Programming and Performance	1	15568	April 14, 2011
OpenCL performance difference Linux/Windows CUDA Programming and Performance	0	1165	May 1, 2013
OpenCL - Extremely slow kernel execution CUDA Programming and Performance	0	811	March 29, 2018
trying to understand kernel parameters and CL_INVALID_WORK_GROUP_SIZE CUDA Programming and Performance	8	3994	February 26, 2010
performance question CUDA Programming and Performance	9	9939	August 4, 2010
Interpreting OpenCL Visual Profiler Results CUDA Programming and Performance	4	2247	June 10, 2010
Kernels get killed: CL_OUT_OF_RESOURCES error waiting for idle CUDA Programming and Performance	1	8085	September 23, 2011
Same Implementation in CUDA and OpenCL but different performance, and OpenCL Faster? CUDA Programming and Performance	2	1225	October 11, 2013
Why CUDA slower that OpenCL? CUDA Programming and Performance	5	1531	September 12, 2018
Difference between CPU and GPU time in the profiler CUDA Programming and Performance	0	3794	January 3, 2010

Why dead code in OpenCL kernel influence result in Nvidia GTX550ti? OpenCL

Related topics