I’m upgrading from Cuda 10.x to 11.x in order to support Ampere cards for a Windows application.
However not only the required cublas/cublasLt dlls went through a x5 size increase and still grow by tens of megabytes with each update but the cuddn*infer.dll also increased drastically in size.
I suppose that the trend is to optimize by hardcoding thousands of specialized kernels but this has really gone out of control!
what is the mimimum package that supports compute_86? I was expecting support from Cuda 11.0 but nvcc rejects anything above compute_80. Is Cuda 11.1 the minimum
are there any problems to expect when using older builds? say Cuda 11.2.2 w/ cudnn 8.1.1
From CUDA 11.0 onwards support Ampere.
Please refer to the support matrix doc to get more info.
Could you please let us know which steps are you following to upgrade CUDA. Have you tried uninstalling the old version and installing the new version again. Did it solve the problem?
@spolisetty Since my last message, I’ve seen in the release notes that Nvidia wanted to better split dlls in order to mitigate the size increase. I also so that release 8.5.0 had slightly smaller dlls.
However today I gave a look at cuda 11.8 + cudnn8.7.0 and again it’s a huge increase over previous version.
Is there any official word addressing that issue? Because redistributing programs with cuda supports becomes more and more difficult due to the size of the libraries