I read on the Kepler whitepaper 110 this:
quote for HyperQ:
encountered false serialization acrosstasks,thereby limiting achievedGPUutilization, can see
up to dramatic performance increase without changing any existing code.
So far, I have only seen examples of utilizing HyperQ with dramatically changing code, to host parallell tasks in CPU memory within the codebase seems complex especially when I have to keep all running code within the same .cu file.
How can HyperQ be utilized without changing any existing code ?
I havent found anyway of parallelling tasks with the Nvidia API…