i have a kernel that does calculations without syncthreads() call. Now, when the last thread finishes, I want to launch another kernel from the GPU itself, to do another different process on the data, but without returning the control back to the CPU (because i guess i will lose a lot of time if i do it). Is this possible to do? the data is already on the GPU, so I just need to relaunch another code on it. How is this done? (without syncthreads() call)
Thanks in advance.