I know that a major change of 1.1 compute capabilities was the introduction of streams and a bunch of asynchronous functions in the API. However, using the 1.0 hardware version, when we have to synchronize with the kernel, it seems that we have to make a blocking call.
Since i guess this function is doing some polling internally (or waits for some interrupts). Is it really impossible that we get a non-blocking version of cuCtxSynchronize for instance ?
Perhaps there is already such a feature that i missed in the documentation ?