(CUDA samples -> advanced -> reduction)
When I run the reduction sample and
When I debug, I get only one cpu result and one gpu result
I want to know how to use each reduce the time from reduction_kernel.cu to reduce0 ~ reduce6 in the reduction sample.
I would be grateful if you could teach me.