Why 4090 training slower than P100 even writing same piece of code?

himgos · May 26, 2023, 11:16am

Hi,

I’m asking this question because I saw benchmarks where 4090 is beating out Tesla p100 gpu. But when am doing same work in live class, I notice their seconds/iteration is 5(let say) and mine is 6.

Why is such difference even if am having better machine?

Is it because they are in Google Colab and am on local machine?

MarkusHoHo · June 6, 2023, 1:31pm

Hi there @himgos,

Benchmarks cited on the Internet are rarely a good source to assess real-live scenarios. Plus it always depends on the system as a whole. I am quite sure that the HPC cloud farm running Google Colab with a P100 has much more RAM and more CPU performance than your local machine. It also dedicates all compute power to the deep learning process where your local system might not. Temperatures, throttling, VRAM, memory bandwidth, those are more factors affecting throughput.

The P100 is a dedicated Compute GPU which was specifically made for things like Deep Learning.

The 4090 is a gaming consumer GPU. It is optimized for gaming workloads which are different than pure compute workloads.

I hope this helps clarify the situation.

Thanks!

decanbay · February 20, 2025, 12:48pm

It is 2 years later but could help someone else. It is probably about your data, if you are using double precision, i.e. float64, P100 is a lot faster than RTX 4090 or even RTX 5090, but if you use single precision, it is the other way around. Those benchmarks are for marketing, if you need FP64 computation, newer cards are getting worse and worse per $.

Topic		Replies	Views
GU H100/L40S Performance CUDA Programming and Performance	4	537	November 25, 2024
Computational speed of High-End and Medium-End Nvidia Tesla cards CUDA Programming and Performance	1	504	November 20, 2019
Difference between A100 vs RTX 4090 in training deep learning models TensorRT cuda , python	2	584	November 30, 2024
Why RTX4090 performs at a level much lower than officially claimed? NVIDIA AI Workbench	2	52	September 29, 2024
Why RTX4090 performs at a level much lower than officially claimed? Linux	2	219	September 29, 2024
GPU for physics simulations CUDA Programming and Performance	5	1162	January 13, 2023
What is the best option to setup on premise GPU cluster for a small company? CUDA Programming and Performance	2	666	October 2, 2024
Critique my HPC Specification CUDA Programming and Performance	8	1332	June 11, 2015
Which one is more suitable for my needs? A100 or 4090? CUDA Programming and Performance	12	40782	January 29, 2024
Which nVidia GPU will work faster? GPU-Accelerated Libraries gpu , gpu-computing	0	31	July 28, 2024

Why 4090 training slower than P100 even writing same piece of code?

Related topics