Calculating GPU Memory Requirement for 4096 CUDA Threads with Multiple 2D Arrays

1668115800 · November 6, 2024, 9:03am

I have divided the workload into 4096 threads (64x64), and each thread calls a kernel function. Inside the kernel function, there are 50 two-dimensional arrays, each allocated in CPU memory with a double-precision floating-point format and a size of 64x64. Could you help me calculate the required GPU memory for this setup?

Curefab · November 30, 2024, 11:41am

Are you talking about CPU or GPU threads?

BTW: 64 GPU blocks with 64 GPU threads each would be not enough to fully occupy modern GPUs.

Normally CPU threads call kernel functions (except when using Dynamic Parallelism).

50*64*64*8 = 1.6 MBytes?
64*64*1.6 MBytes = 6.4 GBytes?

What do you mean by arrays in CPU memory are inside the kernel? That they are used there?

There are different ways to do it, if you have not enough GPU memory.

Are the 6.4 GBytes used as buffer memory or needed for input and output?

Topic		Replies	Views
Where's my bottleneck CUDA Programming and Performance	1	1050	August 29, 2008
Optimisation Strategies when running out of shared memory CUDA Programming and Performance	1	555	March 12, 2011
GPU vs. CPU GPU is always much slower CUDA Programming and Performance	1	10278	June 5, 2009
Why not full occupancy? CUDA Programming and Performance	2	983	November 17, 2012
Maximum of threads On 8600GT CUDA Programming and Performance	6	3570	April 9, 2008
Unexpected CUDA processing time dependency on thread count CUDA Programming and Performance cuda , python , numba	0	786	April 17, 2021
Number of threads CUDA Programming and Performance	0	400	January 19, 2017
number of registers of a GPU processor 5 general purpose registers in a x86 CPU CUDA Programming and Performance	11	16042	January 7, 2009
What info can I extract about use of device memory? CUDA Programming and Performance	1	761	March 16, 2009
GPU's memory CUDA Programming and Performance	4	2835	October 3, 2007

Calculating GPU Memory Requirement for 4096 CUDA Threads with Multiple 2D Arrays

Related topics