Tao-toolkit RTX 3090 perform worse training speed than RTX 2080 ti

renton.hsu.vfx · October 14, 2021, 3:36pm

Please provide the following information when requesting support.

• Hardware (RTX3090/RTX2080ti/)
• Network Type (Yolo_v4)
• TLT Version (3.2)

I trained two exact same yolo v4 models with same datset but two different GPUs, RTX 3090 and RTX 2080 ti. The code for training was originally from ngc and I haven’t change anything.

When both has batch_size equals to 8, It took RTX 3090 580-620 seconds to finish one epoch. The training performance of RTX 3090 was not only a lot slower than I expected and even slower than RTX 2080-ti which spent 540-580 seconds / epoch.

Could anybody tell me why is that?

Morganh · October 15, 2021, 6:21am

Did you mean you run with 2 machines, one with 3090 and another with 2080ti?

renton.hsu.vfx · October 15, 2021, 6:48am

Yes, correct.

Morganh · October 15, 2021, 7:05am

How about the GPU utilization separately?

system · November 9, 2021, 1:10am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Training speed issue TAO Toolkit	2	451	November 15, 2022
TLT yolo_v4 slow training TAO Toolkit	11	838	October 12, 2021
Training Speed is too low While training TAO Toolkit	2	398	April 8, 2022
Stylegan 2 performance issue on RTX 3090 Frameworks cuda , tensorflow	6	2313	October 12, 2021
UNet Training on Tao toolkit is getting stuck TAO Toolkit	7	33	September 2, 2024
RTX3090 runs slower than RTX2080ti Profiling Linux Targets nsight	1	739	July 19, 2021
Cost effective GPU recommendation for TAO training TAO Toolkit	2	461	July 28, 2022
RTX3090 runs slower than RTX2080ti CUDA Programming and Performance	5	1710	June 6, 2021
Why the kernel launch time RTX3090 more than RTX2080 50% Frameworks	0	601	November 4, 2021
Training detection models using TAO Toolkit TAO Toolkit	2	376	August 17, 2022

Tao-toolkit RTX 3090 perform worse training speed than RTX 2080 ti

Related topics