GTX 780 released: GK110 for $650

seibert · May 23, 2013, 4:09pm

For those (including me) looking to try out dynamic parallelism in CUDA, there’s now an option that costs less than Titan:

12 SMX’s, 3 GB RAM, double precision capped at 1/24 of single precision throughput. (For comparison, Titan has 14 SMX’s, 6 GB of RAM, and the driver mode that runs double precision at 1/3 of single precision throughput.)

allanmac · May 23, 2013, 4:33pm

Awesome.

It also looks like some GTX 780’s will have “Boost” speeds over 1GHz. That’s a big Cores-x-MHz product!

Interestingly a ‘Superclocked’ 780 and the TITAN have nearly identical SMX*Boost products: 12 * 1020 ≅ 14 * 876.

The forthcoming “ShadowPlay” live screen recording feature also looks useful.

Jimmy_Pettersson · May 24, 2013, 10:27am

Great!

High single precision capacity
Higher register/thread count (256!)
Dynamic parallelism

Tobbey · May 24, 2013, 3:03pm

It would be very interesting to have a feedback on dynamic parallelism performances on these cards.

I am currently limited by the Cuda Api kernel launching overhead (I have small kernels) and launching each kernel directly from the gpu would be a very interesting feature for me.

Is there any way to find a test on this particular feature, or to propose a simple “microtestbench” code to be executed on such architecture ?

seibert · May 24, 2013, 6:21pm

This is a good question. We are also working on a project here where we are curious how the overhead of a launching a few small kernels (part of a larger processing chain) with a single block on the GPU compares to launching the same kernel on the CPU.

tera · May 24, 2013, 7:54pm

One thing to keep in mind is that dynamic parallelism disables parallel kernel invocation from the CPU, since the GPU cannot know in advance how many kernels will be launched from the device side.

I wonder though if this feature is configurable. The Kepler whitepaper gives the impression that there is dedicated logic on the device to dynamically schedule grids from either the host or the device.

Topic		Replies	Views
Recommended setup for trial GPU computing CUDA Programming and Performance	9	1597	August 8, 2013
Titan>780Ti : any drawbacks? CUDA Programming and Performance	10	1579	November 15, 2013
Advice on single vs multi-GPU system CUDA Setup and Installation	8	2845	May 19, 2014
Titan vs 980 vs Titan 2? CUDA Programming and Performance	4	1738	February 6, 2015
Besides the Titan and Tesla line, which GPUs support Hyper-Q CUDA Programming and Performance	12	2944	January 28, 2014
Graphics card supporting DYNAMIC PARALLELISM IN CUDA Teaching and Curriculum Support	1	1383	September 26, 2013
Is this comparison correct? CUDA Programming and Performance	3	912	August 2, 2013
What to buy now for CUDA calculations? CUDA Setup and Installation	5	6056	February 19, 2015
Performance of GTX 980 Ti as a General Purpose GPU CUDA Programming and Performance	5	4169	September 29, 2015
Dynamic Parallelism supported GPU CUDA Setup and Installation	1	1282	February 28, 2016

GTX 780 released: GK110 for $650

Related topics