Is it possible to get the result of the execution of the thread, and not the entire function, in order to perform operations already on the CPU

Hello, this forum is dedicated to discussions related to using the cuda-memcheck tools.
Questions related to CUDA can be raised at CUDA - NVIDIA Developer Forums