cudaMemGetInfo() how does it work?!?

iam_peter · April 9, 2011, 5:12pm

Hello,

iam currently programming an acceleration structure on the GPU for raytracing.
There is a bug in my code, but i can’t figure out where. The program runs and the bug appears at a non deterministic iteration.
So sometimes my structure is build 500 times without an unspecified launch failure and sometimes just two times with same options set.
I already know that some weird is written into my structure memory, for instance the splitting dimension of a node in the structure is 32012, even though it should be something between 0 - 3 (leaf=3).
Next thing i found is that in the first iteration the used memory of the graphic card is lower then the used memory at the second iteration. After the second iteration it stays constant.
So now my question: How does cudaMemGetInfo() determine how much memory is used?
Would it recognize if i wrote over the bounds of my allocated memory or isn’t that possible?
My idea is that if some function allocates memory then a global int is incremented and if cudaMemGetInfo() is called this global int is used to determine the memory usage ?!?!
Thing is that i don’t allocate new memory during an iteration, so the memory usage shouldn’t increase, but it does…
I am glad for any hint.

regards,
peter

jam11 · April 11, 2011, 9:27pm

I just add this code in my list

cudaMem.cu

And call checkGpuMem(); when needed

#include <stdio.h>

#include "cuda.h"

extern "C"

void checkGpuMem()

{

float free_m,total_m,used_m;

size_t free_t,total_t;

cudaMemGetInfo(&free_t,&total_t);

free_m =(uint)free_t/1048576.0 ;

total_m=(uint)total_t/1048576.0;

used_m=total_m-free_m;

printf ( "  mem free %d .... %f MB mem total %d....%f MB mem used %f MB\n",free_t,free_m,total_t,total_m,used_m);

}

tmurray · April 12, 2011, 4:41am

Contrary to popular belief, cuMemGetInfo() does not actually rely on magic. We ask the kernel mode driver how much memory has been allocated on the card. However, this will not look for out-of-bounds accesses or anything like that; what you want is cuda-memcheck or cuda-gdb (both are rightly considered miracles).

Robert_Crovella · September 24, 2019, 10:10pm

The code posted by jam11 is defective on GPUs with greater than 4GB of memory and should not be used as-is in any CUDA code.

nightbaron · December 11, 2023, 10:51pm

So what is the fix for the same?

Robert_Crovella · December 12, 2023, 3:06pm

The code has various issues. It is not casting correctly, and it is using incorrect printf format specifiers (a modern compiler will tell you that last item).

If it were me, I would simply use cudaMemGetInfo directly on size_t quantities, and print out those size_t quantities using a correct format specifier like %lu, or even better just use std::cout.

#include <iostream>
...
size_t free_t, total_t;
cudaMemGetInfo(&free_t, &total_t);
std::cout << "Free mem: " << free_t << " Total mem: " << total_t << std::endl;

In the original code, the casting is not done correctly. C++ order of operations dictates that c-style casts are done before arithmetic like division. Casting the 64-bit value to a 32-bit value prior to the division could not possibly be correct, if the 64-bit value is larger than about 4 billion (4 GB).

Topic		Replies	Views
cudaMemGetInfo free mem value is not correct CUDA Programming and Performance	1	1141	September 9, 2018
Problems with cudaMemGetInfo with cuda 3.0 CUDA Programming and Performance	2	1890	May 14, 2010
cudaMemGetInfo no longer returns free memory for full system only looks at current process CUDA Programming and Performance	4	4269	November 2, 2018
Strange results when calling cudaMemGetInfo CUDA Programming and Performance	1	936	December 2, 2010
Why nvidia-smi, nor cudaMemGetInfo do not throw error with over-occupied device memory? CUDA Programming and Performance cuda	6	676	June 8, 2023
cudaMemGetInfo returning wrong amounts of free memory CUDA Programming and Performance	6	1659	August 7, 2019
cudaMemGetInfo returns total, free and used memory as 0.00000MB CUDA Programming and Performance	2	1030	March 29, 2018
Incorrect total memory reported by cudaMemGetInfo CUDA Programming and Performance	8	6738	June 11, 2012
cudaMalloc's internal caching behavior makes cudaMemGetInfo inaccurate CUDA Programming and Performance	1	109	August 15, 2025
Information about Cuda Memory Consumption on TK1, problem with cudaMemGetInfo() Jetson TK1	2	662	February 19, 2019

cudaMemGetInfo() how does it work?!?

Related topics