Kernel function invocation latency problem

Hi Nvidia

My question is simple …

Is there have any solution to reduce kernel function invocation latency ?

Thanks.