DriverAPI: Problem with cuDeviceTotalMem

MartyMcFly · January 26, 2012, 1:45pm

Hello,

I’m trying to get the total amount of memory using the driver API in CUDA 4.0 in C#. with interop (Tesla C2075, 6GB)
Therefore I’m using the cuDeviceTotalMem function which I loaded using DllImport from nvcuda.dll.
The problem is now that the result is always 4,2GB. If I now switch to cuDeviceTotalMem_v2 everything works and the 6GB are returned in the bytes result parameter.
What happens here, can I still rely on all my DllImported methods, or do I have to switch every method to …_v2?

If I look at the code in cuda.h it seems that the problem should be solved by aliasing the cuDeviceTotalMem_v2 to cuDeviceTotalMem for example, but it doesn’t work:

#if defined(__CUDA_API_VE #define cuDeviceTotalMem #define cuCtxCreate #define cuModuleGetGlobal #define cuMemGetInfo #define cuMemAlloc #define cuMemAllocPitch #define cuMemFree #define cuMemGetAddressRange #define cuMemAllocHost #define cuMemHostGetDevicePointer #define cuMemcpyHtoD #define cuMemcpyDtoH #define cuMemcpyDtoD #define cuMemcpyDtoA #define cuMemcpyAtoD #define cuMemcpyHtoA #define cuMemcpyAtoH #define cuMemcpyAtoA #define cuMemcpyHtoAAsync #define cuMemcpyAtoHAsync #define cuMemcpy2D #define cuMemcpy2DUnaligned #define cuMemcpy3D #define cuMemcpyHtoDAsync #define cuMemcpyDtoHAsync #define cuMemcpyDtoDAsync #define cuMemcpy2DAsync #define cuMemcpy3DAsync #define cuMemsetD8 #define cuMemsetD16 #define cuMemsetD32 #define cuMemsetD2D8 #define cuMemsetD2D16 #define cuMemsetD2D32 #define cuArrayCreate #define cuArrayGetDescriptor #define cuArray3DCreate #define cuArray3DGetDescriptor #define cuTexRefSetAddress #define cuTexRefSetAddress2D #define cuTexRefGetAddress #define cuGraphicsRes #endif /* __CUDA_API_V RSION_INTERNAL) || __CUDA_API_VERSION >= 3020
cuDeviceTotalMem_v2
cuCtxCreate_v2
cuModuleGetGlobal_v2
cuMemGetInfo_v2
cuMemAlloc_v2
cuMemAllocPitch_v2
cuMemFree_v2
cuMemGetAddressRange_v2
cuMemAllocHost_v2
cuMemHostGetDevicePointer_v2
cuMemcpyHtoD_v2
cuMemcpyDtoH_v2
cuMemcpyDtoD_v2
cuMemcpyDtoA_v2
cuMemcpyAtoD_v2
cuMemcpyHtoA_v2
cuMemcpyAtoH_v2
cuMemcpyAtoA_v2
cuMemcpyHtoAAsync_v2
cuMemcpyAtoHAsync_v2
cuMemcpy2D_v2
cuMemcpy2DUnaligned_v2
cuMemcpy3D_v2
cuMemcpyHtoDAsync_v2
cuMemcpyDtoHAsync_v2
cuMemcpyDtoDAsync_v2
cuMemcpy2DAsync_v2
cuMemcpy3DAsync_v2
cuMemsetD8_v2
cuMemsetD16_v2
cuMemsetD32_v2
cuMemsetD2D8_v2
cuMemsetD2D16_v2
cuMemsetD2D32_v2
cuArrayCreate_v2
cuArrayGetDescriptor_v2
cuArray3DCreate_v2
cuArray3DGetDescriptor_v2
cuTexRefSetAddress_v2
cuTexRefSetAddress2D_v2
cuTexRefGetAddress_v2
ourceGetMappedPointer cuGraphicsResourceGetMappedPointer_v2
ERSION_INTERNAL || __CUDA_API_VERSION >= 3020 */

Thanks
Martin

tmurray · January 26, 2012, 6:00pm

You need to use the latest version for all function calls. Most of the time that’s a v2, we might have a v3 sometime though (if we don’t already). You can’t really mix and match.

MartyMcFly · January 27, 2012, 9:12am

Hi tmurray,

thanks for your answer, but doesn’t the code in my explanation show that the _v2 or even _v3 versions are aliased to the original method without _vx?

Thanks

Martin

red-ray · January 27, 2012, 8:15pm

I had the same issue and wrote the follwong:

#define   cuda_text( name )   #name

cuda_text( cuDeviceTotalMem )

The #define cuDeviceTotalMem cuDeviceTotalMem_v2 in the header means you end up with a string of “cuDeviceTotalMem_v2”

Topic		Replies	Views
cuDeviceTotalMem (32 bit memory size limitation) CUDA Driver API CUDA Programming and Performance	5	5328	June 14, 2011
Incorrect total memory reported by cudaMemGetInfo CUDA Programming and Performance	8	6592	June 11, 2012
cuDeviceTotalMem description a little bit ambigious (multi interpretable) CUDA Driver API DeviceTota CUDA Programming and Performance	1	727	June 13, 2011
cuMemAlloc_v2 return address out of range CUDA Programming and Performance	7	1998	June 11, 2019
cuDeviceTotalMem returns maximum of 4GB CUDA Programming and Performance	5	1074	November 11, 2015
when device memory gets full What happens when the device memory gets full and how much memory... CUDA Programming and Performance	4	2050	April 29, 2009
Got out of memory from cudaMemcpy CUDA Programming and Performance	13	4039	January 28, 2022
Total device memory allocated in an application. CUDA Programming and Performance	4	2451	September 17, 2019
Problems with cudaMemGetInfo with cuda 3.0 CUDA Programming and Performance	2	1814	May 14, 2010
cudaGetDeviceProperties - wrong results CUDA Programming and Performance	0	5991	June 16, 2009

DriverAPI: Problem with cuDeviceTotalMem

Related topics