This is my idea about GPU multi-DIE design

pofice · June 17, 2020, 4:06am

This is the core architecture diagram of the new GPU (the core code is Diamond 100, which I named)
The top four memory controllers are used to control HBM2 (HBM2E) video memory, which is closer to the video memory for easy wiring
The two memory controllers on the upper left and upper right are used to control GDDR6, and you can add video memory.
PCIE 5.0 host interface, 4 ZRLink above are high-speed bus & high-speed hub, easy to exchange data inside and outside the core and GPU interconnection
Based on AMPERE SM, each SM unit has 32 FP64 units, 64 FP32 units, 64 INT32 units, and 4 tensor cores. Each group of GPC units has 10 SM units, each GPU DIE has two GPC units, a total of 1280 CUDA, supports multi-instance GPU (MIG) technology, you can divide a GPU DIE into up to 20 independent GPUs For example, a GPC unit or TPC unit or SM unit can be divided into a GPU instance, so a GPU DIE can be divided into 2 or 10 or 20 GPU instances, a total of three modes, each mode of GPU instance can Run at the same time, and each mode can be matched with each other. GPU instances of different modes can run simultaneously. Each instance has its own memory, cache, and streaming multiprocessor. In multiple GPU instances, a large number of The client provides GPU cloud acceleration, such as each mobile phone account, and different resources can be allocated according to different accounts, and in some lower-configuration clients, the GPU cloud acceleration can make the game screen less stagnant, and can The server used in the school provides GPU computing power for each client, and of course there are more… And the number of ZRLinks of this GPU core doubles, which also means that the Internet bandwidth doubles

Topic		Replies	Views
Kepler and Maxwell, oh my! CUDA Programming and Performance	55	56184	October 19, 2010
Low P2P GPU bandwidth performance between GeForce GPUs CUDA Programming and Performance	20	1960	October 9, 2024
four 9800GX2 cards: will it work? CUDA Programming and Performance	33	23611	May 28, 2008
The fastest platform of GPU computing CUDA Programming and Performance	38	40810	January 19, 2010
embed system the relation ship between arm cores and gup should be more different than pc system. Jetson TK1	0	550	June 8, 2015
Shopping-list for Cuda GPGPU System in 800-1000 euro price-range Goal: A 'budget' GTX 470 (F CUDA Programming and Performance	59	12488	April 15, 2010
Using more than 1 CUDA card at a time. Physics simulations flat out flying on GPU CUDA Programming and Performance	12	12698	March 12, 2010
How NVLink Will Enable Faster, Easier Multi-GPU Computing Technical Blog	10	1000	June 15, 2016
CUDA hardware & software CUDA Programming and Performance	9	2799	November 13, 2010
Inside Pascal: NVIDIA's Newest Computing Platform Technical Blog	51	1271	December 8, 2017

This is my idea about GPU multi-DIE design

Related topics