50x slowdown on one machine vs. another

Jeff_Cox · October 4, 2009, 12:32am

I have developed a couple of kernels for a project to process some images. I have a three year old machine with a GTX 260, and they execute just fine in around 7.8ms (not counting all the memory copying between host and device). However, when I take the same executable to my client, which has a top of the line new PC with a Quadro 290, the kernel execution takes over 400ms! (I assume 290 should be faster than 260.)

I have installed the same version of drivers (190.38) and CUDA (2.3) on both machines.

Does anyone have any ideas what I should be looking at to explain this and fix it? This is not expected, is it?

Thanks for your assistance.

SPWorley · October 4, 2009, 2:59am

Your assumption that the Quadro should be faster is very, very wrong.

The Quadro NVS 290 is a low power display board, optimized for multimonitor displays, not compute horsepower. Its useful for things like airport flight arrival TV screens, or stock tickers on multiple monitors. They’re small and passively cooled so they can be crammed into small form factors.
It has only 16 compute cores (SPs).

The GTX 260 has got 216 cores, at higher frequencies. It’s a “real” GPU.
It’s no comparison.

Jeff_Cox · October 4, 2009, 7:15am

Ah, thank you. I knew that the 285 and 295 were high performance, but I hadn’t heard of the 290, and just assumed it was similar, because of the numbering being so similar to the others.

seibert · October 4, 2009, 6:41pm

NVIDIA model numbering (especially across brands, like between GeForce and Quadro) is very confusing. I find it helpful to check out this page to look up the capabilities of unfamiliar models:

http://en.wikipedia.org/wiki/Comparison_of…rocessing_units

Topic		Replies	Views
GTX 295 executing CUDA code slower than 8600 CUDA Programming and Performance	3	1155	June 2, 2009
Quadro 6000 vs GTX 690 CUDA Programming and Performance	7	16557	January 7, 2015
Geforce VS Quadro for Molecular Dinamics in CUDA Which is better for molecular dinamics? CUDA Programming and Performance	7	16672	May 2, 2010
WHY GTX295 slower as FX1700 CUDA Programming and Performance	9	3123	March 26, 2010
GTX 470 Seems Slow... No Better than GTX 260? CUDA Programming and Performance	5	9386	April 29, 2010
Quadro K2000 outperforms GeForce1060? CUDA Programming and Performance	2	1448	August 15, 2017
Peformance comparison ends in strange results CUDA Programming and Performance	3	807	August 9, 2019
hardware computational power specs how to undersatnd the specs of the different lines of hardware CUDA Programming and Performance	3	4234	January 23, 2010
Quadro vs Geforce GTX CUDA Programming and Performance	20	25328	September 20, 2013
GTX 295 VS Tesla C1060 CUDA Programming and Performance	5	15420	December 25, 2009

50x slowdown on one machine vs. another

Related topics