Cuda Error

xcf1996 · December 16, 2020, 7:38am

Description

here is my problem, when i get yolov4 detect, i used GPU to decode the video stream to frame, then put the frame to tensorRT to inference, here came the promble which like this

inference elasped time:0.5551ms
post elasped time:0.0714ms
pre elasped time:1.0675ms
ERROR: C:\source\rtSafe\cuda\cudaElementWiseRunner.cpp (164) - Cuda Error in nvinfer1::rt::cuda::ElementWiseRunner::execute: 400 (invalid resource handle)
ERROR: FAILED_EXECUTION: Unknown exception
inference elasped time:0.5422ms
post elasped time:0.0723ms
pre elasped time:1.0797ms
ERROR: C:\source\rtSafe\cuda\cudaElementWiseRunner.cpp (164) - Cuda Error in nvinfer1::rt::cuda::ElementWiseRunner::execute: 400 (invalid resource handle)
ERROR: FAILED_EXECUTION: Unknown exception

but when i use the cpu to cap the frame, where it is ok ,can work, i search baidu,where some guy tell me ,the gpu should init once, but i tried, it didnt work

Environment

TensorRT Version: TensorRT-7.1.3.4
GPU Type: 1080 8G
Nvidia Driver Version: 451
CUDA Version: 11.0
CUDNN Version: cudnn-11.0-windows-x64-v8.0.1.13
Operating System + Version: win10
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

AakankshaS · December 17, 2020, 5:46pm

Hi @xcf1996,
Can you please try the same using latest TRT release.
Also, Cuda error is basically because of inappropriate CUDA driver.
Recommend you to check the support matrix for the same.

Thanks!

xcf1996 · December 28, 2020, 1:21am

nope，using latest TensorRT didn`t make it,since problem came from here

void Yolo::doInference(const unsigned char* input, const uint32_t batchSize)
{
//Timer timer;
assert(batchSize <= m_BatchSize && “Image batch size exceeds TRT engines batch size”);
NV_CUDA_CHECK(cudaMemcpyAsync(m_DeviceBuffers.at(m_InputBindingIndex), input,
batchSize * m_InputSize * sizeof(float), cudaMemcpyHostToDevice,
m_CudaStream));
std::mutex mtx;
mtx.lock();
std::cout << "加了一个锁… " << std::endl;
assert(m_Context != nullptr);
std::cout << "m_Context 不为空… " << std::endl;
if (!m_Context->enqueue(batchSize, m_DeviceBuffers.data(), m_CudaStream, nullptr))
std::cout << “入队列有问题，需要排查” << std::endl;
//m_Context->enqueue(batchSize, m_DeviceBuffers.data(), m_CudaStream, nullptr);
mtx.unlock();
for (auto& tensor : m_OutputTensors)
{
NV_CUDA_CHECK(cudaMemcpyAsync(tensor.hostBuffer, m_DeviceBuffers.at(tensor.bindingIndex),
batchSize * tensor.volume * sizeof(float),
cudaMemcpyDeviceToHost, m_CudaStream));
}
cudaStreamSynchronize(m_CudaStream);
//timer.out(“inference”);
}

the input is ok,but when it goto here
m_Context->enqueue(batchSize, m_DeviceBuffers.data(), m_CudaStream, nullptr)
it return 0??,do not make sense,the enqueue is a dll file which i can`t debug ,maybe m_CudaStream out of sync,or maybe just disapper,because i need use GPU to decode frame, i am confused,

AakankshaS · April 8, 2021, 6:33am

Hi @xcf1996 ,
apologies for the delay.
Are you still facing the issue?

Topic		Replies	Views
Segmentation Fault errors and some other (Deserialize the cuda engine failed) TensorRT tensorrt , cuda , ubuntu	1	472	July 4, 2023
Cuda Runtime Error when infering Onnx model TensorRT	3	958	October 11, 2021
[TensorRT] ERROR: 1: [resize.cu::performLinearKernelLaunch::457] Error Code 1: Cuda Runtime (invalid argument) TensorRT tensorrt , cupy	4	5119	June 14, 2022
Invalid resource handle when doing inference with TensorRT TensorRT	3	2176	August 10, 2022
Cuda Error in launchPwgenKernel- When running a specific engine in async TensorRT tensorrt	9	2154	June 11, 2022
Cuda Error Illegal Address DeepStream SDK	10	744	June 7, 2023
Error while moving data from cuda-capable device to host memory - Error Code 1: Cuda Runtime (unspecified launch failure) Jetson Nano tensorrt , cuda	2	573	October 15, 2021
Memory leak in IExecutionContext TRT6 TensorRT	1	1263	March 2, 2020
context or other operations about cuda is blocking ? TensorRT	14	1606	January 24, 2019
Cuda Error in nvinfer1::cudnn::findFastestTactic: 700 TensorRT	7	1300	December 15, 2021

Cuda Error

Description

Environment

Relevant Files

Steps To Reproduce

Related topics