Myelin memory budget exceeded while building TensorRT engine with batch > 1

saifullah3396 · November 26, 2020, 11:45am

Description

Hi, I am trying to convert a onnx model to trt engine. My build script is as follows:

trtexec --fp16 --explicitBatch \
    --workspace=2048 \
    --onnx="model.onnx" --saveEngine="model.trt" \
    --minShapes=\'input\':1x1x64x16 \
    --optShapes=\'input\':1x1x64x320 \
    --maxShapes=\'input\':2x1x64x3200

I am facing the following error while building the engine:

[11/26/2020-16:27:24] [E] [TRT] ../builder/myelin/codeGenerator.cpp (338) - Myelin Error in compileGraph: 69 (myelinExceededMemBudget : Exceeded mem budget of 4294967296. Need 5338390656

I am unable to find any relevant info about this library (myelin) to be able to figure this out. I’m wondering if there is any way to increase the maximum memory limit here? This error only comes up while building the engine with batch size > 1 (as i have added maxShapes = 2x1x64x3200). I have tested it with batch size 1 and it works well.

The onnx model that I want to convert is exported form pytorch with the following configuration:

x = torch.ones((2,1,64,640), dtype=torch.float)
torch.onnx.export(
        model,
        x,
        'output.onnx',
        input_names=['input'],
        output_names=['scores'],
        verbose=verbose,
        opset_version=11,
        dynamic_axes={'input': {0: 'batch', 3: 'width'}})  # batch and width can be dynamic

Additionally I have attached the model files and error log below.

Environment

TensorRT Version: 7.0.0.11
GPU Type: TITAN X (Pascal)
Nvidia Driver Version: 455
CUDA Version: 10.2
CUDNN Version: 7.6.5
Operating System + Version: Ubuntu 18.04
Python Version (if applicable): 3
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

log.txt (431.7 KB) model.txt (1.7 KB) modules.txt (9.5 KB)

Any help would be appreciated. Thanks.

AakankshaS · November 27, 2020, 5:47am

Hi @saifullah3396,
Can you try setting up setMaxWorkspaceSize ?
Also, if the issue persist, Can you share the model in onnx format

Thanks!

saifullah3396 · November 27, 2020, 9:26am

Hi @AakankshaS, thanks for your response. I have tried setting the max batch to upto 6GB but I am getting the same error. One thing that is important to mention here is that I don’t get these errors and am able to build the model even if I build the engine with batch size of 10 like this (notice the batch size is kept constant here):

trtexec --fp16 --explicitBatch \
    --workspace=2048 \
    --onnx="model.onnx" --saveEngine="model.trt" \
    --minShapes=\'input\':10x1x64x16 \
    --optShapes=\'input\':10x1x64x320 \
    --maxShapes=\'input\':10x1x64x3200

Since my model has two dynamic dimensions, that is, the batch (going from 1 to N) and the width (going from 16 to 3200), is it that I am specifying it the wrong way? There is not much info about it in the TensorRT documentation so I tried it this way thinking it would work. Also, does this mean it could work if I create a separate min/opt/max profile for each batch like this:

Profile 0:
--minShapes=\'input\':1x1x64x16 \
--optShapes=\'input\':1x1x64x320 \
--maxShapes=\'input\':1x1x64x3200

Profile 1:
--minShapes=\'input\':2x1x64x16 \
--optShapes=\'input\':2x1x64x320 \
--maxShapes=\'input\':2x1x64x3200

Profile N:
--minShapes=\'input\':Nx1x64x16 \
--optShapes=\'input\':Nx1x64x320 \
--maxShapes=\'input\':Nx1x64x3200

AakankshaS · November 29, 2020, 6:32pm

Hi @saifullah3396,
Kindly refer to the below link for the same.
https://docs.nvidia.com/deeplearning/tensorrt/best-practices/index.html#batching

Topic		Replies	Views
Tensorrt Engine use too much memory TensorRT tensorrt	1	1617	December 13, 2021
TensorRT Python API builder build_engine faiure - Error Code 2: OutOfMemory (no further information) TensorRT	1	1010	March 24, 2022
TRT5.0: Memory error when building engine TensorRT	8	6073	October 31, 2018
Device memory is insufficient to use tactic Jetson AGX Xavier tensorrt , nvbugs	10	9019	January 4, 2022
How to work with explicit batches in python TensorRT	4	2514	February 9, 2022
Can't build TensorRT engine from ONNX due to insufficient memory TensorRT	2	501	January 19, 2024
could not find any implementation for node 2-layer MLP, try increasing the workspace size with IBuilder::setMaxWorkspaceSize() TensorRT	4	3750	October 12, 2021
Run out of memory when creating TensorRT engine from onnx model Jetson Xavier NX tensorrt	7	2800	October 18, 2021
The default value of engine.max_batch_size is 32? TensorRT	4	1838	October 12, 2021
[TensorRT] OutOfMemory Error when building engine from ONNX model TensorRT tensorrt	6	3928	January 2, 2024

Myelin memory budget exceeded while building TensorRT engine with batch > 1

Description

Environment

Relevant Files

Related topics