TensorRT С++ optimization profile

v.stadnichuk · July 2, 2021, 11:30am

Description

This issue related to this

Now I`m trying to start this code on C++. I add optimization profile and have an error:

terminate called after throwing an instance of ‘std::bad_alloc’
what(): std::bad_alloc
Aborted

I’ll send script in private message. Can you help please?

Environment

TensorRT Version : 7.2.3.4
GPU Type : GeForce GTX 1060 6 GB
Nvidia Driver Version : 440.33.01
CUDA Version : 10.2
CUDNN Version : 7.1
Operating System + Version : Ubuntu 18.04
Python Version (if applicable) : 3.6
TensorFlow Version (if applicable) : 2.3.1

NVES · July 2, 2021, 11:37am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

v.stadnichuk · July 2, 2021, 12:45pm

Hi!
I sent script and model to you via private message.

spolisetty · July 5, 2021, 4:24pm

@v.stadnichuk,

This looks like memory allocation issue. Please make sure memory is available. Your memory usage may be on the edge. That is exactly the error given when running out of memory. Try running it again and monitor using ps or top the memory usage. Let us know if you still face this issue.

Thank you.

v.stadnichuk · July 6, 2021, 6:56am

Hi @spolisetty !
Thank you for support! I checked your advises but I have enough memory and issue is still present. Also I attach screenshots with available memory.

spolisetty · July 6, 2021, 5:43pm

@v.stadnichuk,

Thank you for the confirmation. Could you pleas DM us issue repro script/model and complete error logs for better assistance.

v.stadnichuk · July 7, 2021, 7:15am

@spolisetty
Done.

spolisetty · July 13, 2021, 1:45pm

@v.stadnichuk,

Thank you for sharing the steps and files for issue repro. At step of “sudo make clean && sudo make VERBOSE=TRUE” we are facing issues related to make config. Are you able to successfully run this step ?
Please let us know in case you did any changes later.

v.stadnichuk · July 13, 2021, 3:04pm

Yeah, I am able to run this step. Which issue do you have?
Probably you have troubles with OpenCV. For start project you should to install OpenCV.
Please share the issue trace.

v.stadnichuk · July 15, 2021, 11:03am

Script failed on this step:

samplesCommon::BufferManager buffers(mEngine);

222 string in SampleONNXMnist.cpp

v.stadnichuk · July 15, 2021, 12:51pm

Issue considered in TensorRT-7.2.3.4/samples/common/buffers.h in GenericBuffer constructor

github.com

NVIDIA/TensorRT/blob/master/samples/common/buffers.h#L73


      
          }
          
          //!
          //! \brief Construct a buffer with the specified allocation size in bytes.
          //!
          GenericBuffer(size_t size, nvinfer1::DataType type)
              : mSize(size)
              , mCapacity(size)
              , mType(type)
          {
              if (!allocFn(&mBuffer, this->nbBytes()))
              {
                  throw std::bad_alloc();
              }
          }
          
          GenericBuffer(GenericBuffer&& buf)
              : mSize(buf.mSize)
              , mCapacity(buf.mCapacity)
              , mType(buf.mType)
              , mBuffer(buf.mBuffer)

if (!allocFn(&mBuffer, this->nbBytes()))
{
throw std::bad_alloc();
}

spolisetty · July 15, 2021, 1:23pm

@v.stadnichuk,

Could you please share latest complete error logs for better assistance.

Thank you.

v.stadnichuk · July 15, 2021, 1:32pm

@spolisetty

Please find log in attachments
log.txt (1.1 KB)

spolisetty · July 15, 2021, 5:19pm

@v.stadnichuk,

As you mentioned in the repro steps, could you let us know which changes have you done in Makefile.config

spolisetty · July 15, 2021, 6:44pm

Also have you tried with small batch size? could you please share onnx model.
We recommend you to try with latest TRT version 8.0.

v.stadnichuk · July 16, 2021, 7:38am

@spolisetty

Please find in attachments original Makefile.config (Makefile_orig.config) and modified Makefile.config
Makefile.config (14.4 KB)
Makefile_orig.config (12.0 KB)

v.stadnichuk · July 16, 2021, 7:41am

ONNX you can find in archive I sent you on DM be the next path:

TensorRT-7.2.3.4/samples/sampleOnnxMNIST/apm_one_input.onnx

Also I use only one image for model checking - 1.jpg by the same path

v.stadnichuk · July 16, 2021, 11:25am

@spolisetty
As I can see problem considered here:
TensorRT-7.2.3.4/samples/sampleOnnxMNIST/sampleOnnxMNIST.cpp:
line 222:

samplesCommon::BufferManager buffers(mEngine);

Next:
TensorRT-7.2.3.4/samples/common/buffers.h:
line 270:

manBuf->deviceBuffer = DeviceBuffer(vol, type);

Next:
GenericBuffer constructor, line 74:

if (!allocFn(&mBuffer, this->nbBytes()))

Next:
line 184:

return cudaMalloc(ptr, size) == cudaSuccess;

So, I guess issue with buffer allocation. Could you help? Memory is available.

spolisetty · July 17, 2021, 1:57pm

Hi @v.stadnichuk,

I went through changes you’ve done in sampleOnnxMNIST.cpp
I think modifying this script may not be good idea it may lead errors. Based on our understanding, looks like you’re trying to build inference script using Tensorrt C++ api. We recommend you to please build a separate script. And also make sure TensorRT installed correctly.

Following resources may help you, Let us know if you face issues. Create optimization profile like you did in sample onnx script.
Developer Guide :: NVIDIA Deep Learning TensorRT Documentation, https://github.com/NVIDIA/TensorRT/blob/master/samples/opensource/sampleDynamicReshape/sampleDynamicReshape.cpp

Thank you.

v.stadnichuk · July 19, 2021, 12:46pm

Hi @spolisetty

Thank you for help.
I tried script described here and have some troubles. Now I should define optimization profile and explicit batch, but I have error

trt_sample.cpp:147:26: error: ‘NetworkDefinitionCreationFlag’ has not been declared static_cast(NetworkDefinitionCreationFlag::kEXPLICIT_BATCH))};

Can you assist with it? Code in links you provided is different and library usage is also different.
I’ll send you script and ONNX via DM.

Topic		Replies	Views
Work with batch in TensorRT TensorRT tensorrt , opencv , cuda , tensorflow	20	3804	July 20, 2021
TensorRT: input_1: dynamic input is missing dimensions in profile 0 TensorRT	9	2362	February 9, 2021
How to use different profile in tensorrt? TensorRT tensorrt , python	3	1397	July 19, 2022
[TensorRT] OutOfMemory Error when building engine from ONNX model TensorRT tensorrt	6	3819	January 2, 2024
[TensorRT] ERROR: ../rtSafe/safeRuntime.cpp (32) - Cuda Error in free: 700 TensorRT opencv , cuda , tensorflow , python	3	1926	April 28, 2021
Assertion Error in buildMemGraph: 0 (mg.nodes[mg.regionIndices[outputRegion]].size == mg.nodes[mg.regionIndices[inputRegion]].size) TensorRT	10	1293	October 12, 2021
TensorRT 7 ONNX models with variable batch size TensorRT kb	13	12061	October 12, 2021
ONNX Model and Tensorrt Engine gives different output TensorRT tensorrt , onnx	13	5397	June 29, 2022
[executionContext.cpp::executeInternal::652] Error Code 1: Cuda Runtime (an illegal memory access was encountered) \| Cuda failure: 700 TensorRT tensorrt	5	2944	April 11, 2022
Errors with reading pb file in TensorRT and readNetFromTensorflow in C++ TensorRT	3	1238	January 26, 2021

TensorRT С++ optimization profile

Description

Environment

check_model.py

Related topics