eGPU box with thunder bolt 3.0 performance

byning · January 25, 2020, 7:17am

Hi all,

I have 3 same gpu cards, 2 of them are directly installed on the PCIe plots of the motherboard (ASUS prime X299-Deluxe) and, due to the space limits of the PC case, the 3rd one is installed in an external eGPU box (ASUS xg station pro) connected with thunder bolt 3.0 port.

My OS is Ubuntu 18.04.2, Nvidia driver version is 440.44 and CUDA version is 10.0. All the cards are recognized successfully by the OS.

I tested the computational speed of these 3 cards by my deep-learning program (tensorflow frame work, gpu version 1.14) and found that the eGPU one is about two times slower than the other two. Besides, I also run tests using VASP-gpu version as

mpirun -np 8 vasp_gpu

Then all the three cards were used while the system quickly went dead in a short time.

If I unplug the eGPU box and only use the 2 gpu cards installed on the motherboard, the VASP software operates fine.

Has anyone tested the compuational speed of eGPU box with thunderbolt 3.0 port? And is this because of the speed difference of the 3 cards that the VASP software leads to the shutdown of the OS?

Thanks for your help.

Bo-Yuan

mnicely · January 25, 2020, 4:33pm

An important aspect with eGPU is the number of PCIe lines available to the Thunderbolt controller. Do you know if it’s single, double, or quad?

For starters, you might want to try bandwidthTest in the NVIDIA Samples to check if there is a major difference in the eGPU box connection. Then you might want to try a few of the heavy compute (single GPU) examples to get a baseline. If they’re the same card and there are no memory transfers, they should performance the same.

I’ve never used VASP, so I can’t speak to that issue.

byning · January 28, 2020, 5:38am

Hi mnicely,

Thank you very much for the reply. I checked the instruction of the Thunderbolt card and its PCIe3.0 lane is X4. The other two GPUs directly installed on the motherboard has PCIe3.0 lane X16. I am not sure, but I guess maybe it is the bandwith of PCIe lane difference leading to the performance difference. Anyway, I will use Nvidia Samples to run the bandwidth tests for each GPU. Again, thanks a lot.

Topic		Replies	Views
Titan X Pascal scaling with 4 cards ... problems? CUDA Programming and Performance	10	2325	August 27, 2016
Accelerating Machine Learning on a Linux Laptop with an External GPU Technical Blog	23	3217	December 4, 2024
multiGPU poor performance up to 10x lowest performance in multiGPU CUDA Programming and Performance	14	10764	January 18, 2008
GPU is lost during execution of either Tensorflow or Theano code CUDA Programming and Performance	12	12642	March 8, 2020
Three Ultra cards + P6N Diamond performance issue CUDA Programming and Performance	8	6435	January 2, 2008
Using mobo built in video? CUDA Programming and Performance	14	13440	April 19, 2008
second gpu not working correctly CUDA Programming and Performance	8	10483	May 27, 2008
Less GDDR2 X More GDDR3 Performance and Useability in CUDA. CUDA Programming and Performance	6	4643	August 17, 2008
CUDA and openCL support for multiple GPU/PCI devices? CUDA Programming and Performance	7	5365	April 11, 2009
P2P: How do I know if cudaMemcpy falls back to non-P2P? CUDA Programming and Performance	8	2368	October 12, 2021

eGPU box with thunder bolt 3.0 performance

Related topics