So I can summit some work that will wait for the work on the host side done?
I found that there seemed to be no way to signal a CUDA event from host side. If it is so, I will have to summit the work on the GPU side after the work on host side it depends done. And I will end up have to summit everything depends on that GPU work after that.
Actually what I want is to keep my old architecture if possible. And it seems that it would much harder to pipeline the entire job this way, and make the logic much more complex…