i work for my institute and we try to develop a speedup-formula for the cuda system.
i tried a few things to detect the current runtime for a written code. no problem.
now, we think that it would be good to know the speedup in case of used clock cycles for the operations.
is there a function in cuda that could return the number of used clock cycles on the gpu? or have someonean ideahow to detect that? i have tried some things but there are only coarse assessments…
thanks for reading and thinking about that. if someone have an idea i would be grateful. i am programming with c++