Multiple Caffe models on single GPU

Silversparro · October 20, 2015, 10:35am

Hi,

We have trained multiple caffe models already.

We want to use these multiple caffe models for making predictions on Single GPU simultaneously.

Is this possible, if yes how to do it ?

We are getting following error when we try it:

Check failed: status == CUDNN_STATUS_SUCCESS (8 vs. 0) CUDNN_STATUS_EXECUTION_FAILED

Can anyone help in resolving this error ?

Thanks in advance for the help.

Thanks, Abhinav

little_jimmy · October 21, 2015, 6:32am

i can not really provide an answer

but it is clear to me that the error message is not helping much
it provides little reason why the execution failed?
did an instance fail to allocate sufficient memory, etc…?

HannesF99 · October 21, 2015, 1:27pm

If you are invoking the prediction functions from multiple CPU processes, it should work.

If you are invoking the prediction functions from multiple CPU threads (in one application), then it might be because the CUDNN funcstions are not CPU-thred-safe (due to internal usage of global variables like constant memory, texture references, … in the Cuda kernels).

See https://devtalk.nvidia.com/default/topic/491350/_constant_-memory-not-thread-safe-in-cuda-4-0/ or multithreading - How to get GPU kernels using global texture references thread-safe (for multiple CPU threads using a single GPU) - Stack Overflow or https://devtalk.nvidia.com/default/topic/711438/are-npp-routines-cpu-thread-safe-/

Generally, I conservatively assume that none of the black-box CUDA libraries (cublas, cufft, npp, cudnn, …) is CPU-thredsafe. That means e.g. in our framework that we do not call their functions from multiple CPU threads simultanously.

Topic		Replies	Views
crash when using multi-GPU on caffe. CUDNN_STATUS_EXECUTION_FAILED cuDNN	3	1110	February 22, 2019
CUDNN_STATUS_EXECUTION_FAILED with cuda8.0.61 + cudnn6.0.21 + ubuntu16.04 + Caffe-BVLC cuDNN	0	1650	July 9, 2018
Check failed: error == cudaSuccess (3 vs. 0) initialization error * Check failure stack trace: * CUDA Programming and Performance	4	2997	May 5, 2017
cuDNN error when run on Windows GPU-Accelerated Libraries	0	1842	October 24, 2014
Not able to train Model on CPU mode Jetson TX1	2	597	October 18, 2021
cudnn_conv_layer.cpp:53] Check failed: status == CUDNN_STATUS_SUCCESS (4 vs. 0) CUDNN_STATUS_INTERNAL_ERROR Jetson TX2	7	9753	October 18, 2021
My first test on CUDA and some questions sync, thread with CUDA CUDA Programming and Performance	5	3024	November 13, 2007
multi-threaded kernel concurrent execution on a single GPU CUDA Programming and Performance	3	5663	January 14, 2021
code examples: using CPU threads can I see code for any apps using Pthreads on CPU? CUDA Programming and Performance	3	1125	June 9, 2010
question about cudnnSetConvolution2dDescriptor GPU-Accelerated Libraries	1	5371	October 7, 2017

Multiple Caffe models on single GPU

Related topics