Computer architecture for heavy deep learning algorithm?

asaf.eilam · February 10, 2021, 3:29pm

Hi,
I’m trying to configure what is the best architecture for my machine in order to be able to train a neural net with ~15 million parameters that gets as an input a block of 50X512X512 float32. I would like to run it in a decent time. I know that reading the data from the disk decelerates the training and I wonder how to deal with this also.
I would appreciate your reference to the next items:

how do I know what is the minimal size of GPU memory/frequency to carry such calculation.
what kind of SSD and CPU are preferred?
should I use one big GPU or few decent GPUs?

I’m using keras with tensorflow, on python 3.7.
the operating system is windows.

when executing today on a computer with 3 gpus of 11 Gi (2080 Ti) and Ram of 128 Gi,with cpu i9 -9940x, 3.3GHz.
we can train only with 3 input blocks, one on each gpu in minibatch. and each minibatch takes 2-3 sec.

Topic		Replies	Views
Computer architecture for heavy deep learning algorithm Deep Learning (Training & Inference)	1	290	February 8, 2021
What is recommended system configuration for Deep Learning? Deep Learning (Training & Inference)	0	373	April 19, 2020
Slow training of neural networks on GPU CUDA Programming and Performance	17	3968	April 21, 2021
Recommendation for Nvidia GPU Deep Learning (Training & Inference)	1	341	September 29, 2019
GPU resource needed for training 10000 models Frameworks tensorflow	2	470	January 20, 2021
Underperforming Tesla/Titan CUDA Programming and Performance	3	728	March 8, 2019
Speeding Up Deep Learning Training with NVIDIA V100 Tensor Core GPUs in the AWS Cloud Technical Blog	0	249	August 21, 2022
Is this a good match for GPU? CUDA Programming and Performance	5	3613	June 11, 2009
Would smaller GPU make for a better learning experience? CUDA Programming and Performance	2	710	December 8, 2012
GV100 performance issues Frameworks tensorflow	4	760	June 24, 2019

Computer architecture for heavy deep learning algorithm?

Related topics