I’m trying to get the performance (Gigaflop/s of my vector addition), I have already found this:
float msecPervectAdd= ms / nIter; double gigaFlops = (numElements * 1.0e-9f) / (msecPervectAdd/ 1000.0f);
ms = whole execution time (in ms)
nIter = iterations that use to have longer runs
numElements = the data size of my vectors
But I still want to be sure about it.
Your help is appreciated.