addRNNv2 error when using dynamic sequence length

bei.chen · January 21, 2020, 3:36am

I am using TensorRT7 to create a model which includes an RNN structrue, the code for adding an RNN layer is:

auto rnn = network->addRNNv2(
        *inputData,
        mNumLayers,
        mHiddenSize,
        1000,
        nvinfer1::RNNOperation::kLSTM);

the dimensions of inputData are:

nvinfer1::Dims inputDims{3, {-1, -1, 2688}, {nvinfer1::DimensionType::kINDEX, nvinfer1::DimensionType::kSEQUENCE, nvinfer1::DimensionType::kCHANNEL}};

when I compile the code, the following error is reported:

[E] [TRT] Parameter check failed at: ../builder/Network.cpp::addRNNCommon::572, condition: input.getDimensions().d[di.seqLen()] == maxSeqLen

I try to change the maxSeqLen parameter from 1000 to -1, compiles successfully, but fails to run:

[E] [TRT] Parameter check failed at: ../builder/Network.cpp::addRNNCommon::570, condition: maxSeqLen > 0

could somebody tell me how to solve this problem?

SunilJB · January 21, 2020, 6:27am

Hi,

Could you please share the script and model file so we can help better?
Also, can you provide details on the platforms you are using:
o Linux distro and version
o GPU type
o Nvidia driver version
o CUDA version
o CUDNN version
o Python version [if using python]
o Tensorflow and PyTorch version
o TensorRT version

Thanks

bei.chen · January 21, 2020, 6:46am

Hi, SunilJB, this is the platforms information :

Ubuntu 16.04.6 LTS (Xenial Xerus)
GeForce RTX 2080 Ti
295.41
cuda10.0
cuda7.6
tensorrt7.0

dsq0720 · January 21, 2020, 6:53am

I got the same problem, but havn’t found a solution.

SunilJB · January 21, 2020, 8:34am

Hi,

Could you please check if network definition are created with the explicitBatch flag set?
Also, if possible please share the script file so we can better help.

Thanks

bei.chen · January 21, 2020, 8:52am

yes, I have set the explicitBatch flag:

nvinfer1::ICudaEngine* getBRNNEngine(asrSample::LSTM::ptr brnn)
{
    nvinfer1::IBuilder* builder = nvinfer1::createInferBuilder(gLogger);
    nvinfer1::IBuilderConfig* config = builder->createBuilderConfig();
    builder->setMaxBatchSize(gMaxBatchSize);
    config->setMaxWorkspaceSize(gMaxWorkspaceSize);
    if (gFp16) {
        config->setFlag(nvinfer1::BuilderFlag::kFP16);
        config->setFlag(BuilderFlag::kSTRICT_TYPES);
        builder->setFp16Mode(true);
    }   

    nvinfer1::INetworkDefinition* network = builder->createNetworkV2(1U << static_cast<uint32_t>(NetworkDefinitionCreationFlag::kEXPLICIT_BATCH));

    nvinfer1::Dims inputDims{3, {-1, -1, 2688}, {nvinfer1::DimensionType::kINDEX, nvinfer1::DimensionType::kSEQUENCE, nvinfer1::DimensionType::kCHANNEL}};
    nvinfer1::Dims stateDims = brnn->getStateDims();
    nvinfer1::Dims sequenceLengthDims{0, {}, {}};

    auto inputTensor = network->addInput("encoder_brnn_input_data", nvinfer1::DataType::kFLOAT, inputDims);
    auto sequenceLengthTensor = network->addInput("encoder_sequence_length", nvinfer1::DataType::kINT32, sequenceLengthDims);
    auto hiddenStateTensor = network->addInput("encoder_hidden_state", nvinfer1::DataType::kFLOAT, stateDims);
    auto cellStateTensor = network->addInput("encoder_cell_state", nvinfer1::DataType::kFLOAT, stateDims);

    nvinfer1::ITensor *outputState, *lastHiddenState, *linearOutput, *actOutput;

    brnn->addToModel(network, inputTensor, sequenceLengthTensor, hiddenStateTensor, cellStateTensor, &outputState, &lastHiddenState);
    outputState->setName("brnn_output");
    network->markOutput(*outputState);

    auto profile = builder->createOptimizationProfile();
    profile->setDimensions(inputTensor->getName(), OptProfileSelector::kMIN, Dims3{1, 1, 2688});
    profile->setDimensions(inputTensor->getName(), OptProfileSelector::kOPT, Dims3{50, 150, 2688});
    profile->setDimensions(inputTensor->getName(), OptProfileSelector::kMAX, Dims3{100, 300, 2688});
    config->addOptimizationProfile(profile);

    samplesCommon::enableDLA(builder, config, gUseDLACore);
    auto res = builder->buildEngineWithConfig(*network, *config);

    network->destroy();
    builder->destroy();
    config->destroy();

    return res;
}

void LSTM::addToModel(
    nvinfer1::INetworkDefinition* network,
    nvinfer1::ITensor* inputData,
    nvinfer1::ITensor* sequenceLength,
    nvinfer1::ITensor* hiddenState,
    nvinfer1::ITensor* cellState,
    nvinfer1::ITensor** outputState,
    nvinfer1::ITensor** lastHiddenState)
{
    //int maxSeqLen = inputData->getDimensions().d[0];
    int maxSeqLen = -1;
    auto rnn = network->addRNNv2(
        *inputData,
        mNumLayers,
        mHiddenSize,
        maxSeqLen,
        nvinfer1::RNNOperation::kLSTM);
    assert(rnn != nullptr);

    rnn->setInputMode(nvinfer1::RNNInputMode::kLINEAR);
    rnn->setDirection(nvinfer1::RNNDirection::kBIDIRECTION);
    rnn->setSequenceLengths(*sequenceLength);

    std::vector<nvinfer1::RNNGateType> gateOrder({nvinfer1::RNNGateType::kINPUT,
                                                  nvinfer1::RNNGateType::kFORGET,
                                                  nvinfer1::RNNGateType::kCELL,
                                                  nvinfer1::RNNGateType::kOUTPUT});
    for (size_t i = 0; i < mGateKernelWeights.size(); i++)
    {
        bool isW = ((i%8) < 4);
        rnn->setWeightsForGate(i/8, gateOrder[i % 4], isW, mGateKernelWeights[i]);
        rnn->setBiasForGate(i/8, gateOrder[i % 4], isW, mGateBiasWeights[i]);
    }

    rnn->setHiddenState(*hiddenState);
    rnn->setCellState(*cellState);

    *outputState = rnn->getOutput(0);
    *lastHiddenState = rnn->getOutput(1);
}

Also, I find that if using dynamic batch dimension, the memory usage will increased quickly，it is very easy to out of gpu memory.

SunilJB · January 27, 2020, 6:05am

Hi,

Try the ILoop interface introduced in TensorRT 7 and not our legacy RNN interfaces.
Please refer below sample for iLoop implementation:
https://github.com/NVIDIA/TensorRT/blob/572d54f91791448c015e74a4f1d6923b77b79795/samples/opensource/sampleCharRNN/sampleCharRNN.cpp

Thanks

bei.chen · February 25, 2020, 10:31am

Hi,

I want to use addRNNv2 interface, because it’s simple to use, the ILoop interface is a little complicated. However, I will try ILoop interface, hope that addRNNv2 interface supports dynamic shape in the future.

Thanks

bei.chen · February 28, 2020, 2:32am

Hi SunilJB,

I have some issues about the sample you sent to me:
https://github.com/NVIDIA/TensorRT/blob/572d54f91791448c015e74a4f1d6923b77b79795/samples/opensource/sampleCharRNN/sampleCharRNN.cpp

when this sample uses iLoop, the network definition are created with the explicitBatch flag set, but the dimensions of input tensor have no explicit batch dimension, and I have not see where the batch size is set:

nvinfer1::ITensor* data = network->addInput(mParams.bindingNames.INPUT_BLOB_NAME, nvinfer1::DataType::kFLOAT,nvinfer1::Dims2(mParams.seqSize, mParams.dataSize));

Thanks

bei.chen · February 28, 2020, 6:24am

In addition, ILoop interface also needs input max sequence length tensor, which should be const value, so the problem is same as addBRNNv2 interface:

nvinfer1::ITensor* maxSequenceSize = network->addConstant(nvinfer1::Dims{}, Weights{DataType::kINT32, &mParams.seqSize, 1})->getOutput(0);

bei.chen · March 2, 2020, 3:30am

Hi SunilJB,

According to DEFINE_BUILTIN_OP_IMPORTER(LSTM) , I know how to use ILoop interface for supporting dynamic sequence length:

github.com

onnx/onnx-tensorrt/blob/main/builtin_op_importers.cpp

/*
 * SPDX-License-Identifier: Apache-2.0
 */

#include "builtin_op_importers.hpp"
#include "ConditionalHelpers.hpp"
#include "LoopHelpers.hpp"
#include "ModelImporter.hpp"
#include "NvInfer.h"
#include "NvInferPlugin.h"
#include "NvInferRuntime.h"
#include "OnnxAttrs.hpp"
#include "RNNHelpers.hpp"
#include "ShapeTensor.hpp"
#include "half.h"
#include "onnx2trt_utils.hpp"

#include <algorithm> // For std::min, std::max
#include <array>
#include <cmath>

This file has been truncated. show original

Thanks!

Topic		Replies	Views
Dynamic parameter "maxSeqLen" for addRNNv2 TensorRT	3	567	December 1, 2020
How could I set the maxSeqLen of addRNNv2 when using dynamic shape? TensorRT	5	1460	February 26, 2020
Calling addRNNv2 with multiple batches TensorRT	0	800	June 18, 2019
This version of TensorRT does not support dynamic ReverseSequence length TensorRT tensorflow	3	3285	February 17, 2022
Tensorrt Error when using dynamic batch: data: kMIN dimensions in profile 0 are [24,3,224,224] but input has static dimensions [48,3,224,224] TensorRT	3	2517	February 8, 2022
CUDNN_STATUS_EXECUTION_FAILED ERROR by call cudnnRNNForwardTrainingEx function. cuDNN	3	1894	February 12, 2020
TensorRT warnings : Dynamic dimensions required for input: input, but no shapes were provided TensorRT	2	1742	October 12, 2021
how could i add a lstm layer(set max_seq_length and set reverse weights)? TensorRT	1	1192	June 4, 2020
TF-TRT RNN NMT model optimise, Input tensor with shape [?,?] TensorRT	0	659	May 29, 2019
Create TesorRT with dynamic batch TensorRT	8	2288	September 8, 2020

addRNNv2 error when using dynamic sequence length

Related topics