getting wrong results when calling cublas in coupling with C++/CLI and C#

afshiinzkh · April 26, 2016, 2:38pm

I have written a wrapper in C++11/CLI with Visual Studio to use CUDA’s CuBLAS. I am using CUDA Toolkit 7.0.

Here is the source code of my wrapper:

#pragma once

#include "stdafx.h"
#include "BLAS.h"
#include "cuBLAS.h"

namespace lab
{
    namespace Mathematics
    {
	    namespace CUDA
	    {
		   
		    void BLAS::DAXPY(int n, double alpha, const array<double> ^x, int incx, array<double> ^y, int incy)
		    {
			    pin_ptr<double> xPtr = &(x[0]);
				pin_ptr<double> yPtr = &(y[0]);
     			pin_ptr<double> alphaPtr = α

		    	cuBLAS::DAXPY(n, alphaPtr, xPtr, incx, yPtr, incy);
		    }
       }
   }
}

To test this code, I wrote the following test in C#:

using System;
using Microsoft.VisualStudio.TestTools.UnitTesting;
using System.Linq;
using lab.Mathematics.CUDA;

namespace lab.Mathematics.CUDA.Test
{
  [TestClass]
  public class TestBLAS
  {
    [TestMethod]
    public void TestDAXPY()
    {
        var count = 10;
        var alpha = 1.0;
        var a = Enumerable.Range(0, count).Select(x => Convert.ToDouble(x)).ToArray();
        var b = Enumerable.Range(0, count).Select(x => Convert.ToDouble(x)).ToArray();

        // Call CUDA
        BLAS.DAXPY(count, alpha, a, 1, b, 1);

        // Validate results
        for (int i = 0; i < count; i++)
        {
            Assert.AreEqual(i + i, b[i]);
        }
    }
  }
}

The program compiles with x64 architecture with no error. But the results I get are different every time I run the test. More precisely, the array b is the result and it has different values every time. And I don’t know why.

I am Also adding my cuda code maybe there, someone can find a problem. note that I don’t get any error, warning whatsoever while compiling. I am also wondering maybe I have to do some changes in the compilation while I did nothing and used the default options.

void cuBLAS::DAXPY(int n, const double *alpha, const double *x, int incx, double *y, int incy)
		{
			// Allocate GPU memory
			double *devX, *devY;
			cudaMalloc((void **)&devX, (size_t)n*sizeof(*devX));
			cudaMalloc((void **)&devY, (size_t)n*sizeof(*devY));

			// Create cuBLAS handle
			cublasHandle_t handle;
			cublasCreate(&handle);

			// Initialize the input matrix and vector
			cublasSetVector(n, sizeof(*devX), x, incx, devX, incx);

			// Call cuBLAS function
			cublasDaxpy(handle, n, alpha, devX, incx, devY, incy);

			// Retrieve resulting vector
			cublasGetVector(n, sizeof(*devY), devY, incy, y, incy);

			// Free GPU resources
			cudaFree(devX);
			cudaFree(devY);
			cublasDestroy(handle);
		}

harryz · April 27, 2016, 2:52am

Hi afshiinzkh,

This is Nsight visual studio forum, for cuda programming question you can ask it at CUDA Programming and Performance forum, for cublas queston you can ask it at GPU-Accelerated Libraries forum.

Best Regards

Topic		Replies	Views
Incorrect results when using cublas matrix multiplication GPU-Accelerated Libraries	1	1513	April 28, 2016
CUBLAS problems CUDA Programming and Performance	10	10309	July 1, 2009
Compiling CUBLAS in VS2005 CUDA Programming and Performance	3	2356	November 3, 2014
Help with NVCC and cuBLAS problem CUDA Programming and Performance	2	2012	July 9, 2010
cublasZgemm() gives false result for large data and potential bug GPU-Accelerated Libraries	6	1149	October 12, 2021
Cublas fp8 cublasLtMatmulAlgoGetHeuristic returns 0 - nvcc issue GPU-Accelerated Libraries cublas	1	632	July 21, 2023
CuBLAS documentation doesn't match the current implementation CUDA Setup and Installation	1	693	June 29, 2016
Nvlink error : Undefined reference to 'cublasZgemm_v2' in ******.obj' GPU-Accelerated Libraries cublas	19	1997	May 1, 2025
[Beginner] Math operations giving incorrect answers CUDA Programming and Performance	9	1440	November 3, 2010
Cuda produces wrong result CUDA Programming and Performance	5	2537	April 8, 2011

getting wrong results when calling cublas in coupling with C++/CLI and C#

Related topics