CPU cores vs GPUs

eyalhir74 · March 13, 2009, 8:23am

Hi,
Im thinking of putting a 4xGTX295 machine. I’ve tested a quad core with 2 GTX295 and it seems ok for now.
What I was told that in order to have 4 PCI lines to host 4 GTX295, I can only use one CPU quad core, therefore
each core will have to handle two GPUs (or one dual GTX).
Has anyone tried this ? what do you think? I guess it would have been better to have one core per gpu but…

Also, has anyone tested the new GTX285? how is it compared to the GTX280 or GTX295?

Any comments/thoughts are more then welcomed :)

thanks
eyal

Sarnath · March 13, 2009, 8:32am

I dont think the number of cores or cpus have any direct relation with the number of GPUs.

Each GPU needs to be controlled by one thread of execution. Having 4 cores help to program 4 GPUs because all 4 threads can run simultaneously and so can be faster… THats all.

I might be wrong though…

eyalhir74 · March 13, 2009, 8:58am

There are a couple of issues here:

A. Is there a nVidia recommendation?

B. Has someone tested such a thing in production and noted a degragation in performance?

C. Obviously I can open 100 threads per core, but that probably won’t be that good even if GPUs are not involved.

D. In my code I have a lot of CPU/GPU ping-pong and therefore I’m a bit afraid that 8 CPU threads (one per GPU)

on only 4 cores will degrade the overall performance.

In anycase once the machine will arrive, I’ll update the forum on my findings :)

eyal

MisterAnderson42 · March 13, 2009, 11:27am

http://www.nvidia.com/object/tesla_build_your_own.html

Yes. There are a posts on the forums. There are more examples of background processes slowing the performance of a CUDA app significantly.

Very likely.

cern_freak · March 17, 2009, 2:24pm

here is comparison of key parameters for gtx280/gtx285/gtx295 according to official specs

GPU		  CORE FREQ(MHz)	 SHADER FREQ(MHz)	   MEMORY FREQ(MHz)	GLOBALMEM_BANDWIDTH(GB/s)

GTX280	  602				 1296			   1107  (512bit)			141.7

GTX295	  576				 1242			   999	(448-bit)		2*111.9

GTX285	  648				 1476			   1242  (512bit)			159

GTX285OC  702								   1323  (512bit)			169.3

sources : nvidia.com, evga.com

AndreiB · March 17, 2009, 2:37pm

Actually, this depends significantly on kernels. If you’re firing hundreds of kernel invocations per second (i.e. each kernel takes only few milliseconds) then high load in background processes is a problem. You will also likely to see performance degradation if #CPUs < #GPUs in this case.

If your kernels run for longer time, i.e. for second or so, you can play with CU_CTX_SCHED_YIELD flag and it will likely help you to avoid performance degradation.

tmurray · March 18, 2009, 9:40pm

Use blocking sync in 2.2 if you’re worried about CPU utilization. See explanation here.

Topic		Replies	Views
GPU vs CPU performance comparison CUDA Programming and Performance	9	14991	August 13, 2009
More cores than GPUs CUDA Programming and Performance	4	3542	June 2, 2009
C1060 VS GTX295 CUDA Programming and Performance	7	8262	April 25, 2009
high efficiency when running multiple jobs simultaneously on one GPU what does this indicate? CUDA Programming and Performance	5	1187	October 13, 2010
MultiGPU information CUDA Programming and Performance	3	2325	June 8, 2009
GeForce GTX 295 vs. 285 for CUDA development CUDA Programming and Performance	4	8524	August 11, 2009
CUDA perormances CUDA Programming and Performance	10	7129	January 22, 2008
Performance gap for a short test code between GPU and CPU CUDA Programming and Performance	8	1861	October 26, 2017
CUDA is slower than expected. Is something missing? CUDA Programming and Performance cuda , gpu , gpu-computing , parallel-computing	4	241	July 7, 2024
Multiple CPU threads Performance hit CUDA Programming and Performance	5	5381	February 28, 2008

CPU cores vs GPUs

Related topics