Nvidia Tesla P100

NVD · April 5, 2016, 5:33pm

Jen-Hsun announced the amazing Tesla P100 GPU at GTC 2016.

NVD · April 5, 2016, 5:48pm

http://www.nvidia.com/object/tesla-p100.html

Skybuck · April 6, 2016, 1:22am

Will it also have improved “random access memory transactions per second” ?

allanmac · April 6, 2016, 5:25pm

So what’s the over-under that the consumer Pascal multiprocessors will have such massive register files?

I wouldn’t be surprised if the GP10x consumer variants have 32K regs per 64 core SMP.

Hopefully I’m wrong!

allanmac · April 18, 2016, 6:04pm

There is a new whitepaper: NVIDIA GP100 Pascal Architecture – Infinite Compute for Infinite Opportunities

njuffa · April 18, 2016, 7:35pm

I wonder whether the writer of that whitepaper is aware of the meaning of “infinite”, or whether Pascal is in fact the solution for all NP hard problems out there.

scottgray · April 18, 2016, 8:06pm

I’m pretty not happy about the decrease in shared memory size per SM. But I guess the effective size doubles in fp16 now so maybe it’s not a big deal. Pretty sure I’ll mostly only care about lower precision performance from now on.

I’d also rather have fewer SMs with more schedulers/cores. I generally have no problem filling an SM with warps, but often times it’s the total number of available blocks that comes up short. But I guess I could be looking harder for independent work and leveraging streams more.

It would nice to know what the new L1 and instruction cache sizes are.

CudaaduC · April 18, 2016, 8:15pm

Scott, Have you had a chance to work with Pascal yet?

BTW I enjoyed your talk at GTC.

My only complaint was that they only gave you 25 minutes to speak while I think the audience would have been fine with a hour or more.

allanmac · April 18, 2016, 8:22pm

@njuffa, Infinite™®*!

@scottgray, for all the reasons you list, I’m hoping the consumer GP10x is closer to an sm_50’ish 128/4/64K/64KB FP32/FP64/REGS/SMEM.

njuffa · April 18, 2016, 8:53pm

Well I guess in an age where “unlimited” internet connectivity is in fact capped, “infinite” compute may have acquired a new meaning as well.

The reduced shared memory size jumped out to me as well, that seems to be asking for trouble, performance portability wise. I wonder whether it may be a consequence of the much higher SM core frequencies coupled with a desire to keep shared memory latency low?

From what I can tell Pascal-based consumer-level products are still quite some time off into the future, so nothing to worry about until then. I am particularly eager to see how much more compute they managed to squeeze into the lower-end GPUs without auxiliary power connectors. Will it really be 2x Maxwell?

NVD · April 20, 2016, 9:54am

External Media

March 27-30 2017 for GTC 2017, guessing the GP100 GeForce TITAN & Quadro P6000 will be launched at that time since the demand for Tesla P100 is off the charts.

Topic		Replies	Views
Inside Pascal: NVIDIA's Newest Computing Platform Technical Blog	51	721	December 8, 2017
Pascal GP100 to be released 2nd Quarter 2016? CUDA Programming and Performance	7	3047	March 1, 2016
Tesla P4, P40 Accelerators Deliver 45x Faster AI CUDA Programming and Performance	11	2960	September 19, 2016
Anyone out there using the Maxwell Tesla M40 or M60? CUDA Programming and Performance	14	13293	February 5, 2016
Nvidia announces Tesla V100 (Volta) CUDA Programming and Performance	19	5254	November 30, 2017
Inside Volta: The World’s Most Advanced Data Center GPU Technical Blog	43	1108	October 1, 2018
Nvidia Pascal TITAN Xp, TITAN X, GeForce GTX 1080 Ti, GTX 1080, GTX 1070, GTX 1060, GTX 1050 & GT 1030 CUDA Programming and Performance	157	77533	September 25, 2017
TITAN X CUDA Programming and Performance	35	10411	March 23, 2015
GTX 1080ti CUDA Programming and Performance	14	6073	March 11, 2017
New Pascal GPUs Accelerate Inference in the Data Center Technical Blog	3	377	October 5, 2016

Nvidia Tesla P100

Related topics