Hi,
I’m trying to read the performance counters of two kernels running concurrently.
I read in the documentation that performance counters can only be read as aggregated. Is planned to provide some mechanism to read the performance counter of individual kernels?