Computer architecture for heavy deep learning algorithm

asaf.eilam · February 8, 2021, 7:41am

Hi,
I’m trying to configure what is the best architecture for my machine in order to be able to train a neural net with ~15 million parameters that gets as an input a block of 50X512X512 float32. I would like to run it in a decent time. I know that reading the data from the disk decelerates the training and I wonder how to deal with this also.
I would appreciate your reference to the next items:

how do I know what is the minimal size of GPU memory/frequency to carry such calculation.
what kind of SSD and CPU are preferred?
should I use one big GPU or few decent GPUs?

I’m using keras with tensorflow, on python 3.7.
the operating system is windows.

when executing today on a computer with 3 gpus of 11 Gi (2080 Ti) and Ram of 128 Gi,with cpu i9 -9940x, 3.3GHz.
we can train only with 3 input blocks, one on each gpu in minibatch. and each minibatch takes 2-3 sec.

jimscott · February 8, 2021, 3:26pm

Can you elaborate on the operating system and which DL frameworks you are using?

Topic		Replies	Views
Computer architecture for heavy deep learning algorithm? Deep Learning (Training & Inference)	0	276	February 10, 2021
What is recommended system configuration for Deep Learning? Deep Learning (Training & Inference)	0	377	April 19, 2020
Slow training of neural networks on GPU CUDA Programming and Performance	17	4025	April 21, 2021
Recommendation for Nvidia GPU Deep Learning (Training & Inference)	1	346	September 29, 2019
GPU resource needed for training 10000 models Frameworks tensorflow	2	475	January 20, 2021
Underperforming Tesla/Titan CUDA Programming and Performance	3	731	March 8, 2019
Multi-GPU Setup CUDA Setup and Installation	0	723	January 15, 2017
eGPU set up for deep learning and compute tasks on a small laptop GPU - Hardware tensorflow , cudnn , gpu	1	9257	December 7, 2023
GPU Out Of Memory - Tensorflow Object Detection API Frameworks	1	346	January 22, 2025
Better GPU for training & Inference & Execution LLModels TensorRT cudnn	1	506	November 30, 2023

Computer architecture for heavy deep learning algorithm

Related topics