I have kernel code that I want to run different sets of data. I transfer all the data at the beginning to the global memory and move each set to shared and process and send data back.
Should there be a kernal launch for each set of data or there is a way to move on the data without multiple kernel launches?