check failed : cudaMalloc() check failure stack trace : *** aborted(core dumped)

1393048706 · January 19, 2019, 10:01am

OS:ubuntu 16.04.4
OS type: 64-bit
GPU:GeForce GTX 750/PCIe/SSE2
Nvidia driver version:384.130
CUDA version:8.0
CUDNN version:7.0.5
TensorRT version:3.0.4

when I try to usr sample_mnist to complete myself code ,I get the same error all the time whenever I change the cudaMalloc parameter several time.

The error is:

Check failed: cudaMalloc(&buffers[inputIndex], 3*1 * INPUT_H * INPUT_W * sizeof(float))
*** Check failure stack trace: ***
Aborted (core dumped)

sampleSPHERE.cpp (7.09 KB)
prototxt.txt (14.1 KB)

J-Penny · January 24, 2019, 10:03am

Hello，
I met this problem also, have you solved it already? if so , it will be nice to tell me the method.
thanks a lot.

NVES · January 24, 2019, 9:41pm

hello,

can you please clarify what you meant by “get the same error all the time whenever I change the cudaMalloc parameter several time.”?

J-Penny · January 25, 2019, 2:10am

Hello,
platform:Jetson Xavier with Jetpack 4.1.1
I means that I also met the problem about cudaMalloc failed when I run the inference, so that I want to know the factors which will cause cudaMalloc to fail, and are there any restrictions on calling this function ?
Thank for your reply.
the errors:

WARNING: Logging before InitGoogleLogging() is written to STDERR
F0125 09:38:37.156844 10949 TensorRtCaffeModel.cpp:157] Check failed: cudaMalloc(&buffers[inputIndex], inputSize) 
*** Check failure stack trace: ***
Aborted (core dumped)

the code:

void doInference(IExecutionContext& context, float* input, float* output0,int* output1, int batchSize)
{
   const ICudaEngine& engine = context.getEngine();

   // input and output buffer pointers that we pass to the engine
   assert(engine.getNbBindings() == 3);
   void* buffers[3];

   /*data*/
   int inputIndex  = engine.getBindingIndex(INPUT_BLOB_NAME);
   DimsCHW inputDims = static_cast<DimsCHW&&>(engine.getBindingDimensions(inputIndex));
   size_t inputSize = batchSize * inputDims.c() * inputDims.h()*inputDims.w() * sizeof(float);
……
   // allocate GPU buffers and a stream, inputSize = 3*1024*1024*4
   CHECK(cudaMalloc(&buffers[inputIndex], inputSize)); 
   CHECK(cudaMalloc(&buffers[outputIndex0], outputSize0 ));
   CHECK(cudaMalloc(&buffers[outputIndex1], outputSize1 ));
……
}

NVES · January 25, 2019, 4:48pm

Hello,

Hello, please reference cudaMalloc for API limitations and restrictions. https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__MEMORY.html#group__CUDART__MEMORY_1g37d37965bfb4803b6d4e59ff26856356

But I see “check failed”. what was the error returned by cudaMalloc()? the core dump maybe a consequence of you continuing to use the faulted handle.

Topic		Replies	Views
Runtime API error 4: unspecified launch failure on cudaMalloc CUDA Programming and Performance	0	11906	July 28, 2011
Problem with cudaMalloc CUDA Programming and Performance	4	10187	October 29, 2008
cudaMalloc error in big loop CUDA Programming and Performance	12	15783	May 21, 2008
Errors trying to run the samples "cudaMalloc failed" CUDA Programming and Performance	3	3913	February 9, 2009
cudaMalloc() leads to segment fault Jetson TX1	9	4722	June 30, 2017
cudaMalloc error CUDA Programming and Performance	0	7319	March 16, 2010
cudaMalloc failing cuda malloc failing CUDA Programming and Performance	0	2034	August 8, 2011
what's wrong with cudaMalloc ? CUDA Programming and Performance	1	1632	March 26, 2010
Problem with malloc() and cudaMalloc() on Jetson TX1 CUDA Programming and Performance	2	1012	March 21, 2017
700 an illegal memory access was encountered CUDA Programming and Performance	1	1396	September 2, 2022

check failed : cudaMalloc() check failure stack trace : *** aborted(core dumped)

Related topics