The cuDNN and cuBlas libraries take up ~800MB GPU memory. I found in this post Why tensorRT occupy many memory ? , said that the library could be shared among all processes. But in my experiment, each process will own a separate copy. Is there any method to share those libraries?
Hi kayccc,
I’m not using the Jetson platform. My hardware platform is NVIDIA 2080 Ti.
Hi there, I met the same problem that each process owns a separate copy. Have you solved your problem?
similar problem on jetson nx
Hi, I have same problem, how can I share cudnn dll for all process? any tutorial ?