Problem with multiple GPUs The multiple GPUs are not working in parallel

Edwen · August 31, 2010, 1:39am

There are two kernels in my program, with the second one more complicated than the first one. I am trying to use 3 GPUs in parallel execution. From the results, the first kernal runs normally, with the total time slightly longer than each individual GPU. However, the total execution time of the second kernel is longer than the sum of all 3 GPUs. Can anyone help explain what may cause the problem? Here are the results:

First Kernel…

v = 224.06217957
Elapsed time (without Greeks): 0.140319 sec

Profiling Information for GPU Processing:

Device 0 : Tesla T10 Processor
Reduce Kernel : 0.13262 s

Device 1 : Tesla T10 Processor
Reduce Kernel : 0.13319 s

Device 2 : Tesla T10 Processor
Reduce Kernel : 0.13462 s

Second Kernel…

v = 224.06217957
Lb = 21.34053040
Elapsed time (with Greeks): 1.558295 sec

Profiling Information for GPU Processing:

Device 0 : Tesla T10 Processor
Reduce Kernel : 0.51363 s

Device 1 : Tesla T10 Processor
Reduce Kernel : 0.51602 s

Device 2 : Tesla T10 Processor
Reduce Kernel : 0.51722 s

By the way, I am using Tesla C1060 and I modified my program according to the sample program “simpleMultiGPU” provided in Nvidia OpenCL SDK sample codes. I am working under Linux. Thanks,

_Big_Mac · August 31, 2010, 7:00pm

Do the devices share a single context or do they use separate ones?

_Big_Mac · August 31, 2010, 7:00pm

Do the devices share a single context or do they use separate ones?

Edwen · September 1, 2010, 8:07am

They share a single context. The problem is my first kernel doesn’t have any problem, but for the second one, the GPUs seems not running in parallel.

Edwen · September 1, 2010, 8:07am

They share a single context. The problem is my first kernel doesn’t have any problem, but for the second one, the GPUs seems not running in parallel.

_Big_Mac · September 2, 2010, 8:02am

[url=“http://forums.nvidia.com/index.php?showtopic=176628”]http://forums.nvidia.com/index.php?showtopic=176628[/url]

_Big_Mac · September 2, 2010, 8:02am

[url=“The Official NVIDIA Forums | NVIDIA”]The Official NVIDIA Forums | NVIDIA

Topic		Replies	Views
Unstable/Unreliable GPU Device (Tesla C1060) CUDA Programming and Performance	3	3265	May 13, 2010
Problematic multi GPU execution CUDA Programming and Performance	6	1978	June 12, 2012
Behaviour in running two programs on single GPU(Tesla K40m)? CUDA Programming and Performance	2	708	December 5, 2014
Single vs. Multiple contexts with multiple GPUs CUDA Programming and Performance	3	12576	December 28, 2010
Multiple Kernels CUDA Programming and Performance	3	2756	March 6, 2008
About weird performance of multiple GPUs CUDA Programming and Performance	0	4291	January 5, 2009
[SOLVED] What causes my OpenCL kernel serialized when running on multiple GPUs? CUDA Programming and Performance kernel	1	835	August 8, 2020
Concurrent Kernels Bug / Undocumented Behavior (Urgent) need info on "simple" problem with c CUDA Programming and Performance	2	905	June 18, 2010
Parallel execution of kernels from different contexts on K20C CUDA Programming and Performance	4	1514	May 6, 2013
Parallel execution of OpenCL kernels of two different programs on a GPU CUDA Programming and Performance	1	504	September 30, 2018

Problem with multiple GPUs The multiple GPUs are not working in parallel

Related topics