10A6000 or 10A40 for training large language models?

athar_va · January 10, 2023, 11:28pm

Hello,

My advisor’s research lab is looking to buy a Deep learning server, but we’ve been having a hard time choosing between whether to use the A6000 GPUs or the A40 GPUs.

Some facts:

There are about 10 people in the lab, all working on deep learning based research.
Server will be installed in a server room maintained by university IT, so ambient temperature / power shouldn’t be an issue.
Current config allows 10 GPUs on one server.

I’ve looked through a couple of blog posts and here is what I understand:

A6000 and A40 belong to the same family of GPUs
A40s have passive cooling, while A6000s have active cooling.
A6000s have an overclocked memory bus => they are slightly faster at inference time.
A6000s also have a smaller form factor => we can add more GPUs to the server if needed (?)

Question:

Is passive cooling going to be a problem if the machine runs at full capacity?
Do we risk bottleneck-ing the A40s / A6000s in any way by putting them all in one big server?
Is there any difference in the software maintainability of the A40s v/s the A6000s? Specifically, are we prone to running into more configuration issues on the A6000s vs the A40s?

Apologies for the laundry list of questions!

Topic		Replies	Views
Question Regarding A40 vs A6000 for VMWare vGPU Server More vGPU Forums	1	5219	January 14, 2022
Mixing V100 and A40 in the same server CUDA Programming and Performance	3	1039	July 28, 2023
Hardware requirements for deep learning, Tensorflow, Spark CUDA Setup and Installation	1	1689	May 25, 2018
Cooling a Tesla M40 CUDA Programming and Performance	4	5478	December 30, 2021
Difference between A100 vs RTX 4090 in training deep learning models TensorRT cuda , python	2	564	November 30, 2024
Best GPU for AI workloads (not DL training) CUDA Programming and Performance	16	5653	April 1, 2021
What is the best option to setup on premise GPU cluster for a small company? CUDA Programming and Performance	2	625	October 2, 2024
Can we add all three GPU's GPU(A6000) + GPU (1x Nvidia ADA 6000 48 GB and 1x Nvidia L40s 48 GB) in same Motherboard GPU - Hardware	2	386	June 4, 2024
Question : RTX A6000 Max operating temperature CUDA Programming and Performance gpu , nvidia-smi	3	5032	September 12, 2023
Which NVIDIA GPUs are more suitable for high-performance computing? CUDA Programming and Performance	33	2021	November 13, 2024

10*A6000 or 10*A40 for training large language models?

Related topics

10A6000 or 10A40 for training large language models?