Cudnn dll for cuda 11 are too big for redistribution

_PM · November 26, 2021, 1:28pm

I’m deploying my inference software using openCV multi-backend API.
I’m happy to support Nvidia backends, at the cost of distributing ~200Mb of additional files (cudnn_cnn_infer64_8.dll and cudnn_ops_infer64_8.dll) in addition to ~100Mo of Cuda 10.x dlls (cudart64_102.dll, cublas64_10.dll, cublasLt64_10.dll)

But I found out that if I want to support latest Geforce RTX 30x0 I need to ship with Cuda 11.x
Now the problem is that the 2 aforementioned cudd dlls for Cuda 11 are 800Mb big!
What happened? Is there a way this could be split in smaller, more granular packages? I don’t need support for float16 or int8 inference for instance, could we save space without these kernels?

Thanks in advance for your insights.

scottricketts · November 29, 2021, 5:25pm

Thanks for your message. We agree that the growth of the cuDNN DLL size is problematic, and we’re working on resolving this. E.g. you may have noticed that the size of cudnn_ops_infer was reduced by about 70% from 8.2.0 to 8.3.0. We know this doesn’t completely solve your problem – just mentioning it to point out that we’re working on it.

To answer your specific questions:

What happened?

As we’ve added more capabilities to cuDNN (e.g. new GPU architectures), the library size has grown.

Is there a way this could be split in smaller, more granular packages? I don’t need support for float16 or int8 inference for instance, could we save space without these kernels?

We are exploring various options for splitting the library further. It’s useful to know that your use case would benefit from splitting based on data type.

_PM · November 30, 2021, 9:07pm

Thank you very much for your reply.
I had not noticed the dll reduction since I’m still shipping wiht Cuda 10. ops_infer is indeed much smaller, but cnn_infer has grown to 737Mo. But it’s a great news knowing that you’re working on it.
Do you confirm that there is no other way to support GTX 3080s?
Do you have any timeframe for smaller test release?
Will you share on this forum which options you have when it comes to splitting the library?

joel.schaal · February 28, 2024, 4:42pm

Any news on that? We are getting close to 2GB now, is there still no way to make that footprint smaller?

Topic		Replies	Views
How to overcome the huge increase of windows dlls - minimum cuda/cudnn builds that support Ampere? cuDNN	7	1590	December 2, 2022
Can CUDNN be split more finely? cuDNN	3	633	October 12, 2021
Windows DLL sizes cuDNN	1	800	November 21, 2022
Cudnn8 crash with stack overflow cuDNN	11	3968	April 15, 2023
cuDNN 8 License question cuDNN	1	582	July 8, 2021
cuDNN 8.0.5 load libcudnn_cnn_infer.so crashes cuDNN	7	2755	April 8, 2021
Cudnn-10.2-linux-x64-v8.1.0.77.tgz requires CUDA 11? cuDNN	3	794	February 5, 2021
cuDNN v6 INT8 convolution failing with CUDNN_STATUS_NOT_SUPPORTED cuDNN	12	5210	March 3, 2020
[cudnn bug] 3D Convolution failure when using large image size (GPU memory is okay) cuDNN	1	961	July 29, 2018
Cudnn8 takes more than 7gb cuDNN	0	527	October 12, 2020

Cudnn dll for cuda 11 are too big for redistribution

Related topics