hi,
i’d like to know if cudaprof flag “GPU usec” includes time spent in registers - device memory loads/stores.
thanks