CUDNN_STATUS_EXECUTION_FAILED ERROR by call cudnnRNNForwardTrainingEx function.

kuramawzw · February 5, 2020, 3:07pm

I want to use cudnnRNNForwardTrainingEx for variable sequence length, but when I set variable sequences length by call

cudnnSetRNNDataDescriptor(_x_data_desc
                      , _cudnn_data_type
                      , CUDNN_RNN_DATA_LAYOUT_SEQ_MAJOR_UNPACKED//curent fix time major
                      , step
                      , batch
                      , _input_size
                      , seq_len
                      , (void *)&fill_value //
                      ); 
       cudnnSetRNNDataDescriptor(_y_data_desc
                      , _cudnn_data_type
                      , CUDNN_RNN_DATA_LAYOUT_SEQ_MAJOR_UNPACKED//curent fix time major
                      , step
                      , batch
                      , _hidden_size * _dir_count
                      , seq_len
                      , (void *)&fill_value //, NULL
                      );

Then I call

CudnnRunCheckRet(
                    cudnnRNNForwardTrainingEx(
                        _handle
                        , _rnn_desc
                        , _x_data_desc, gx->void_ptr()
                        , _ht_desc, init_ht
                        , _ct_desc, init_ct
                        , _par_desc, gw->void_ptr()
                        , _y_data_desc, gy->void_ptr()
                        , _ht_desc, out_ht
                        , _ct_desc, out_ct
                        , nullptr, nullptr
                        , nullptr, nullptr
                        , nullptr, nullptr
                        , nullptr, nullptr
                        , g_work->void_ptr(), _work_size
                        , g_reser->void_ptr(), _reserver_size
                        ));

I got error CUDNN_STATUS_EXECUTION_FAILED , But only set seq_len all equals step, it run correct, What’s problem ?

Env Detail :
Tesla T4
NVIDIA-SMI 440.33.01 Driver Version: 440.33.01 CUDA Version: 10.2
Cudnn v7.6.5

Sample code detail :

auto rnn_direction_mode = _bi_direction ?
                     CUDNN_BIDIRECTIONAL : CUDNN_UNIDIRECTIONAL;

        CudnnRunCheckRet(cudnnDropoutGetStatesSize(_handle, &_dropout_state_size));
        CudaRunCheckRet(cudaMalloc(&_dropout_state_ptr, _dropout_state_size));
        CudnnRunCheckRet(cudnnSetDropoutDescriptor(_dropout_desc, _handle, 0.0f
                            , _dropout_state_ptr, _dropout_state_size, 0));
        CudnnRunCheckRet(cudnnSetRNNDescriptor_v6(_handle
                        , _rnn_desc
                        , _hidden_size, _stack_layer_num
                        , _dropout_desc, _rnn_input_mode
                        , rnn_direction_mode, _cudnn_rnn_mode
                        , CUDNN_RNN_ALGO_STANDARD
                        , _cudnn_data_type));
       CudnnRunCheckRet(
                cudnnSetRNNMatrixMathType(_rnn_desc, CUDNN_TENSOR_OP_MATH));
       float fill_value = 0;
        CudnnRunCheckRet(
                cudnnSetRNNDataDescriptor(_x_data_desc
                    , _cudnn_data_type
                    , CUDNN_RNN_DATA_LAYOUT_SEQ_MAJOR_UNPACKED//curent fix time major
                    , step
                    , batch
                    , _input_size
                    , seq_len
                    , (void *)&fill_value //, NULL
                    ));
        CudnnRunCheckRet(
                cudnnSetRNNDataDescriptor(_y_data_desc
                    , _cudnn_data_type
                    , CUDNN_RNN_DATA_LAYOUT_SEQ_MAJOR_UNPACKED//curent fix time major
                    , step
                    , batch
                    , _hidden_size * _dir_count
                    , seq_len
                    , (void *)&fill_value //, NULL
                    ));

         _x0_desc.set3dDesc(_cudnn_data_type, batch, _input_size, 1);
        _y0_desc.set3dDesc(_cudnn_data_type, batch, _hidden_size * _dir_count, 1);
        _state_size = 2 * _dir_count * _stack_layer_num * batch * _hidden_size ;
        _ht_desc.set3dDesc(_cudnn_data_type, _stack_layer_num * _dir_count, batch, _hidden_size);
        _ct_desc.set3dDesc(_cudnn_data_type, _stack_layer_num * _dir_count, batch, _hidden_size);
         CudnnRunCheckRet(
                    cudnnRNNForwardTrainingEx(
                        _handle
                        , _rnn_desc
                        , _x_data_desc, gx->void_ptr()
                        , _ht_desc, init_ht
                        , _ct_desc, init_ct
                        , _par_desc, gw->void_ptr()
                        , _y_data_desc, gy->void_ptr()
                        , _ht_desc, out_ht
                        , _ct_desc, out_ct
                        , nullptr, nullptr
                        , nullptr, nullptr
                        , nullptr, nullptr
                        , nullptr, nullptr
                        , g_work->void_ptr(), _work_size
                        , g_reser->void_ptr(), _reserver_size
                        ));

Does anyone
tell me what’s the problem ? Thanks

SunilJB · February 7, 2020, 4:45am

Hi,

Could you please check if sequence length is satisfying below condition:
Each element in seqLengthArray must be greater than 0 but less than or equal to maxSeqLength. In the packed layout, the elements should be sorted in descending order
https://docs.nvidia.com/deeplearning/sdk/cudnn-api/index.html#cudnnSetRNNDataDescriptor

Thanks

kuramawzw · February 7, 2020, 9:26am

sequence length all set same value(max sequence length - 1), it will get CUDNN_STATUS_EXECUTION_FAILED Error, only it all set to max sequence lenght ,it runs correct!

SunilJB · February 12, 2020, 5:18am

Hi,

Could you please share the sample repro script so we can help better?

Thanks

Topic		Replies	Views
cudnnRNNForwardInferenceEx doesn't support zero sequence inside the batch, why? cuDNN	0	659	August 16, 2019
RNN seq2one cuDNN	1	578	September 28, 2022
Use of cudnn rnn forwardtraining and backwardtraining cuDNN	11	2328	October 13, 2018
Unable to complete Deep Learning training. Error: CUDNN_STATUS_EXECUTION_FAILED Frameworks (archived) tensorflow	0	1209	September 2, 2020
addRNNv2 error when using dynamic sequence length TensorRT	10	1383	March 2, 2020
cudnnRNNForward() issue cuDNN cuda	5	1223	August 4, 2022
Memory corruption after calling cudnnRNNForward with CUDNN_RNN_DATA_LAYOUT_BATCH_MAJOR_UNPACKED layout (CUDA 11.0 + cuDNN 8.0.5) cuDNN	6	1071	October 12, 2021
Training quickdraw model using CudnnLSTM leads to CUDNN_STATUS_EXECUTION_FAILED cuDNN	1	1099	April 17, 2019
CuDNN: Issue with Setting Up RNN DescriptorV8 cuDNN cuda , cudnn	0	249	April 15, 2024
Use of cuDNN RNN cuDNN	16	6804	September 20, 2018

CUDNN_STATUS_EXECUTION_FAILED ERROR by call cudnnRNNForwardTrainingEx function.

Related topics