- enable persistence mode
- If you only intend to use a single GPU, then use the CUDA_VISIBLE_DEVICES environment variable to restrict your CUDA runtime footprint to that GPU only:
That is about all I know of that you can reasonably do to reduce init time. Init time may vary based on your exact program, CUDA version, driver version, OS (e.g. linux or windows), exact GPU being used, size of system memory, number of GPUs in the system (although see above) and probably other factors.
In some cases, enabling persistence mode can make a substantial difference in init time.