Question about cudaDeviceScheduleBlockingSync

It is working like it should if combined with the Cuda events (cudaEventBlockingSync). It blocks the thread and not wasting CPU cycles.

The problem is if I use streams without events. After cudaStreamSynchronize the thread is blocked but the CPU is still spinning.