Low GPU usage on tensorflow (RTX 3090)

axel.davy · July 29, 2021, 9:27am

Hi!

Spec:
Driver Version: 470.57.02 CUDA Version: 11.4
NVidia Geforce RTX 3090
Tensorflow 2.5.0 CUDNN 8202, using mixed_fp16 training

I’ve been upgrading my 2080TI to an 3090 and noticed the training speed of my model almost didn’t increase.
I noticed the GPU usage is lower than it used to be for the 2080TI. Using nvidia-smi, the GPU usage shown is between 40 and 80%. If I use very large batches, the same code goes down to 11% usage to 100% usage.

This is not due to data loading or cpu limitations. Indeed, just running the network again and again on tensor already uploaded on the GPU will give the same low usage.

Seeing this, I suspected some kernels were using only a small portions of the GPU cores.
Checking with the tensorflow profiling tools seem to confirm that:

As can be seen on that profiling image, the costliest kernel has a grid dimension of only 9,1,5.
Many other kernels of the network seem to have very small grids as well.

As the 3090 seems to have way more cores than my previous 2080TI, this core underutilization should explain the fact I’m hardly seeing any performance gain with the switch.

My network is a modified TTFNet without deformable convolutions, using resnet18, training on COCO with a batch of 16.
Increasing the batch size doesn’t improve the performance (the execution time is just multiplied linearly).
I can give the tensorflow performance log, or other things if needed.

axel.davy · August 11, 2021, 11:33am

There must have been an issue with my graphic card. It died after a week of use, and with the replacement card my GPU usage is better and my training faster.

Topic		Replies	Views
Tao-toolkit RTX 3090 perform worse training speed than RTX 2080 ti TAO Toolkit	4	1087	November 9, 2021
tensorflow-gpu not using gpu? Jetson TX2	4	4345	October 18, 2021
RTX3090 runs slower than RTX2080ti CUDA Programming and Performance	5	1841	June 6, 2021
I think the 4090 is not performing properly CUDA Programming and Performance cuda , tensorflow , python	1	1156	March 11, 2023
Why is my GPU usages not going up with cuda? ( only around 10% usage) CUDA Programming and Performance	0	1318	February 2, 2015
CUDA/Tensorflow utilization CUDA Programming and Performance	0	694	May 20, 2019
RTX3090 runs slower than RTX2080ti Profiling Linux Targets nsight	1	777	July 19, 2021
GPU functioning only at 16% with CUDA and cuDNN installed (Geforce GTX 750 Ti) CUDA Programming and Performance	5	2676	May 26, 2018
Tensorflow not working on geforce 3090 Frameworks (archived) cuda , tensorflow , ubuntu , drive-cuda	3	2787	March 15, 2021
Why the kernel launch time RTX3090 more than RTX2080 50% Frameworks (archived)	0	624	November 4, 2021

Low GPU usage on tensorflow (RTX 3090)

Related topics