Transpose with cublasDgeam routine for row major ordered rectangular matrice

balkan88 · October 28, 2016, 11:40am

Hey all,

I am learning C++ and Cuda and have some difficulties to pick the right parameters for the routine.

I have a row major matrice of size M*N where M>N .I would like to take transpose of the matrix in the beginning of the code in order to be able to deal with cusolver and cublas libraries working with column major matrices. But even the particular routine for transpose in cublas work with column major order matrices and i somehow give the wrong parameters to the routine that causes an invalid value error. Could not find out the right order of the parameters .

this is the output on the screen.

GPU Device 0: "GeForce GTX 1080" with compute capability 6.1

Cuda environment is starting...
factoring for k=0:
cublasSafeCall() failed at ../src/transpose.cu:40 : CUBLAS_STATUS_INVALID_VALUE

This is the code

#include "transpose.h"

#include "cublas_v2.h"
#include <cuda_runtime.h>

#include <stdlib.h>
#include <stdio.h>
#include <assert.h>

#include "errorChkcublas.h"

void trans(double * V,
           int M,
           int N)
{
    int lda = N;
    int ldb =M;
    int ldc = N;
    cublasHandle_t handle;
    cublasStatus_t status;

    //double * clone;
    //clone = V;

    //cudaMalloc((void **)&clone , M * N * sizeof(float));

    status = cublasCreate(&handle);

    if (status != CUBLAS_STATUS_SUCCESS)
        {
            printf("cublasCreate returned error code %d, line(%d)\n", status, __LINE__);
            exit(EXIT_FAILURE);
        }
    const double alf = 1.0;
    const double bet = 0.0;
    const double *alpha = &alf;
    const double *beta = &bet;

    CublasSafeCall(cublasDgeam( handle, CUBLAS_OP_T, CUBLAS_OP_N, M, N, alpha, V, lda, beta, V, ldb, V, ldc));
    cudaDeviceSynchronize();

    cublasDestroy(handle);

Instead of creating an axuliary matrix and of copying from V to it which is costly, I gave V to the 10th parameter in the routine. Would that be a problem?

Thanks in advance for help!!

Topic		Replies	Views
cublasSgeam: Matrix Transpose Issue GPU-Accelerated Libraries	2	4638	April 16, 2013
Having trouble using cublas_dgemm with row major matrix [solved] GPU-Accelerated Libraries	1	880	November 23, 2016
cublasDgeam subroutine in cusolver library produces wrong result GPU-Accelerated Libraries	2	690	November 5, 2016
DGEMM parameter number 8 had an illegal value GPU-Accelerated Libraries	7	10078	August 12, 2013
Matrix-Vector Multiply with cublasDgemv CUDA Programming and Performance	4	3073	January 2, 2010
Matrix multiplication and transpose in row-major matrices GPU-Accelerated Libraries cublas	1	669	July 13, 2022
Cublas and Matrix Multiplication of a transpose matrix with CUBLAS CUDA Programming and Performance	10	13254	June 14, 2010
Question about cublas demo matrixMulCUBLAS CUDA Programming and Performance	6	2194	May 11, 2015
Bugs when trying to perform tranpose of a matrix using cuSPARSE GPU-Accelerated Libraries	2	729	October 12, 2021
Can CuBLAS do a simple transpose? GPU-Accelerated Libraries	3	216	November 8, 2024

Transpose with cublasDgeam routine for row major ordered rectangular matrice

Related topics