How A30 GPU is faster than A10 GPU?

gelin.guo1 · June 20, 2022, 9:09am

Hi,

I’ve been comparing the specs of A10 vs A30 for AI Inference workflow. I really don’t understand how A30 is faster than A10 on FP16 Tensor Core compute:

A30 has 3,804 CUDA cores and 224 third-gen Tensor Cores.
A10 has 9,216 CUDA cores and 288 third-gen Tensor Cores.

BUT, A30 offers 165 FP16 TC TFLOPS vs A10’s 125 FP16 TC TFLOPS.

Could you explain a bit more of how A30 gains extra performance? I’ve been using the number of cuda cores and tensor cores to estimate inference performance, but it seems I’ve missed something more.

Thanks a lot.

Topic		Replies	Views
Does A30 GPU gives 3x RTX A4000 Performance GPU-Accelerated Libraries gpu	0	739	November 20, 2022
Updating my records CUDA Programming and Performance	7	6724	February 7, 2011
TF32 TFLOPs of GeForce RTX 3090 vs A40 CUDA Programming and Performance	2	2968	September 11, 2023
GF100 vs GF104 Performance question CUDA Programming and Performance	18	9043	September 4, 2010
Nvidia GF104 vs GF100 CUDA Programming and Performance	24	23104	October 12, 2010
Tesla c2050 vs Tesla T10 Processor: which is normally faster? CUDA Programming and Performance	4	4000	September 1, 2010
NVIDIA vs ATI architecture CUDA Programming and Performance	5	2037	February 8, 2011
Dumb hardware question CUDA Programming and Performance	5	1561	December 21, 2009
performance of new nvidia chip CUDA Programming and Performance	15	6504	January 5, 2010
Fermi architecture details where can I find them? CUDA Programming and Performance	16	4113	April 8, 2012

How A30 GPU is faster than A10 GPU?

Related topics