I’m starting to evaluate the possibility to use the CUDA power for microcontroller simulation:
what I want is the possibility to use the parallelism provided by CUDA to simulate
n microcontroller execution of the same program in parallel.
I.e: I have 1 program, and I have to execute it many times with different input data:
is it possible to use CUDA to execute it in parallel on the different input data sets?
Any idea or reference of the possibility or impossibility of this approach?
Executing the same code on different sets of data is the heart of data-parallel programming, so of course this is possible! Start with the CUDA programming guide as the first reference.
Sure, you will probably use some efficiency to divergent warps. But the hardware is actually very efficient at handling this. Don’t discount an idea until you’ve tried it out and proven via benchmarking that the divergent warps are actually too costly.