The number of network parameters calculated by the two methods is inconsistent

cqiao0 · June 30, 2021, 2:27am

I use evolutionary method to train LSTM network, calculate and assign weights and bias values to the network according to the example method(cuDNN-sample/RNN_example.cu at master · Hardware-Alchemy/cuDNN-sample · GitHub). But netparamstotal = weight_ size / size of (float) is inconsistent with the sum of weightscounts and biascounts. Why? How to calculate and assign network parameters correctly?

In addition, change the bias mode (CUDNN_ RNN_ BIAS/CUDNN_ RNN_ DOUBLE_BIAS) Why is netparamstotal unchanged?

Code snippet:

size_t weight_size;

    checkCudnnErrors(cudnnGetRNNWorkspaceSize(cudnnHandle, rnn_desc, seq_length, x_desc, &workspace_size));
    checkCudnnErrors(cudnnGetRNNParamsSize(cudnnHandle, rnn_desc, x_desc[0], &weight_size, CUDNN_DATA_FLOAT));
    checkCudaErrors(cudaMalloc((void **) &weights, weight_size));
    checkCudaErrors(cudaMalloc((void **) &workspace, workspace_size));

    // initialize filter descriptors
//    cudnnFilterDescriptor_t w_desc;
    netParamsTotal=weight_size / sizeof(float);
    int dimW[] = {static_cast<int>(weight_size / sizeof(float)), 1, 1};
    checkCudnnErrors(cudnnCreateFilterDescriptor(&w_desc));
    checkCudnnErrors(cudnnSetFilterNdDescriptor(w_desc, CUDNN_DATA_FLOAT, CUDNN_TENSOR_NCHW, 3, dimW));

    // weights
    size_t linearIndex = 0;
    for (int layer = 0; layer < num_layers; layer++) {
        cudnnDataType_t data_type;
        cudnnTensorFormat_t format;
        int nb_dim, filter_dim[3];
        cudnnFilterDescriptor_t linear_filter_desc, linear_bias_desc;
        float *linear_layer_filter, *linear_bias=nullptr;

        for (int linear_layer = 0; linear_layer < num_linear_layers; ++linear_layer) {
            // filter
            checkCudnnErrors(cudnnCreateFilterDescriptor(&linear_filter_desc));
            checkCudnnErrors(cudnnGetRNNLinLayerMatrixParams(cudnnHandle, rnn_desc, layer, x_desc[0],
                                                             w_desc, weights, linear_layer, linear_filter_desc,
                                                             (void **) &linear_layer_filter));
            checkCudnnErrors(
                    cudnnGetFilterNdDescriptor(linear_filter_desc, 3, &data_type, &format, &nb_dim, filter_dim));
            weightsCounts[linearIndex] = filter_dim[0] * filter_dim[1] * filter_dim[2];
            linearLayerFilters[linearIndex] = linear_layer_filter;

            // bias
            if(biasMode!=CUDNN_RNN_NO_BIAS){
                checkCudnnErrors(cudnnCreateFilterDescriptor(&linear_bias_desc));
                checkCudnnErrors(cudnnGetRNNLinLayerBiasParams(cudnnHandle, rnn_desc, layer,
                                                               x_desc[0], w_desc, weights, linear_layer, linear_bias_desc,
                                                               (void **) &linear_bias));
                checkCudnnErrors(cudnnGetFilterNdDescriptor(linear_bias_desc, 3, &data_type, &format, &nb_dim, filter_dim));
                biasCounts[linearIndex] = filter_dim[0] * filter_dim[1] * filter_dim[2];
                linearBiases[linearIndex] = linear_bias;
            }else{
                biasCounts[linearIndex] = 0;
                linearBiases[linearIndex] = nullptr;
            }

            ++linearIndex;

            checkCudnnErrors(cudnnDestroyFilterDescriptor(linear_filter_desc));
            if(biasMode!=CUDNN_RNN_NO_BIAS){
                checkCudnnErrors(cudnnDestroyFilterDescriptor(linear_bias_desc));
            }

        }
    }

AakankshaS · June 30, 2021, 6:33pm

Hi @cqiao0 ,
Please allow us sometime. We are checking on this.

Thanks!

randers · July 8, 2021, 8:40am

HI @cqiao0 , could you please provide the cudnn API log (Developer Guide :: NVIDIA Deep Learning cuDNN Documentation). I would expect weight_size returned by cudnnGetRNNParamsSize() to be equal the sum of weightsCounts and biasCounts over all linearIndex.
Also the weight_size should be different for NO_BIAS and DOUBLE_BIAS cases

Regards,
Roman

cqiao0 · July 19, 2021, 3:53am

Using cudnnSetRNNDescriptor_v8, there are no problems. thanks!

system · September 17, 2021, 3:54am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What's the shape of W of RNNs in cudnn? GPU-Accelerated Libraries	1	836	April 22, 2017
cudnnGetRNNLinLayerMatrixParams parameter question cuDNN	0	608	December 7, 2018
what is the params mean in cudnnRNNForwardInference(), and how can I use it ? cuDNN	0	614	March 5, 2019
Weightspace parameters and organisation cuDNN	1	570	September 22, 2021
Why two biases in RNN? cuDNN cudnn	0	115	June 18, 2024
When I use cudnnGetRNNWorkspaceSize，segmentation fault GPU-Accelerated Libraries	0	649	May 23, 2017
cudnnRNNForward() issue cuDNN cuda	5	1228	August 4, 2022
[SOLVED] How to correctly set y and dy in cudnnRNNBackwardData+cudnnRNNBackwardWeights ? cuDNN	4	1018	October 15, 2018
cudnn Rnn : about cudnnRNNInputMode_t? cuDNN	1	861	August 9, 2018
Which parameter is bad cuDNN	4	108	September 15, 2024

The number of network parameters calculated by the two methods is inconsistent

Related topics