Floating Point Operations per SEC calculation

himajyothi802 · September 23, 2021, 3:42pm

Hello

As nvprof providing the metrics for the floating point operations count for kernel.
How to calculate the floating point operations Per sec for multiple kernel calls ?

Do we need to consider the Time provided by the nvprof ? Or any other methods ?

Robert_Crovella · September 23, 2021, 3:55pm

total floating point ops divided by total kernel duration

many nsight compute metrics can be optionally configured to deliver a per-second measurement.

himajyothi802 · September 24, 2021, 2:01am

If we have some n kernels and each kernel being called several times. How to calculate the flops ?

I can get the Flop_count_per_kernel from nvprof -metrics.
As well as I can get No_of_calls & Time from the nvprof.

Flops = (No_of_calls * Flop_count_per_kernel ) / Time.

Is that calculation is correct ?
If wrong, plz mention the correct way to do.

Thanks

Robert_Crovella · September 24, 2021, 2:13am

for a metric like flops, nvprof will display minimum, maximum and average numbers across n runs, for each kernel.

I would take the average number for a given kernel, and multiply it by the number of times that kernel is run. I would add these products for all the kernels in question, then divide that total by the total duration of all the kernels. All of this data is available from nvprof. You would have to combine the results of each separate kernel together.

That should give you a fairly defensible number that you can call the average flops per second for your device code (or for those kernels in your device code).

himajyothi802 · September 24, 2021, 2:15am

Okay…Thanks for your reply…

Topic		Replies	Views
Counting flops what's in and what's out? CUDA Programming and Performance	0	1781	June 9, 2012
How to quantify speed FLOPs integer and logic operations per second CUDA Programming and Performance	3	2006	September 14, 2011
how to calculate #Gflop/sec? CUDA Programming and Performance	2	6821	April 29, 2009
confusion about nvprof documentation CUDA Programming and Performance	1	1076	November 18, 2013
How to calculate the total number of FOP and floating-point performance of special operations(exp sin sqrt)? CUDA Programming and Performance	3	5451	December 26, 2016
Differences in FLOPS calculation CUDA Programming and Performance	1	777	December 26, 2019
Why are there min, max, average values for flop_count_sp or flop_count_hp metrics? Profiling Linux Targets nsight	0	587	October 6, 2021
Floating point operations by nvprof CUDA Programming and Performance	3	833	October 17, 2018
How to figure out a total number of double operations in cuFFT::cufftExecZ2Z on a device? Legacy PGI Compilers	2	652	February 1, 2022
Number of operations in a TensorRT model TensorRT	2	2080	June 9, 2020

Floating Point Operations per SEC calculation

Related topics