Memory alloc limited to less than half available RAM

Omegaice88 · June 18, 2017, 12:58am

I am trying to use a tool that requires ~2GB of GPU Memory in a single allocated array. With a fresh install of the latest Jetpack and disabling xserver I am unable to allocate more than 1986MB with cudaMalloc and I am not sure why.

I wrote a simple test program and repeatedly changed the value of size until I am unable to allocate more.

#include <iostream>

#define CUDA_SAFE_CALL(call)						\
do {		                                                        \
	cudaError_t err = call;						\
	if (cudaSuccess != err) {				        \
		const char * errorString = cudaGetErrorString(err);	\
		fprintf(stderr,					        \
			"CUDA error in func '%s' at line %i : %d:%s.\n",\
			__FUNCTION__, __LINE__, err, errorString);	\
		throw std::runtime_error(errorString);			\
	}								\
} while (0)

int main(void) {
  void *x;
  size_t size = 1987*1048576l;
  CUDA_SAFE_CALL(cudaMalloc(&x, size));
  return 0;
}

To make sure that I am using as little RAM elsewhere I clear the buffers and cache and enabled an 8GB swap file just incase giving me the following stats:

ubuntu@tegra-ubuntu:~/$ free -m && sync && sudo /bin/sh -c 'echo 3 > /proc/sys/vm/drop_caches' && free -m
              total        used        free      shared  buff/cache   available
Mem:           3994         699        3083          21         212        3718
Swap:          8191           0        8191
              total        used        free      shared  buff/cache   available
Mem:           3994         201        3676          21         117        3718
Swap:          8191           0        8191

When I compile and run it, it fails:

ubuntu@tegra-ubuntu:~/$ nvcc -std=c++11 test.cu && ./a.out
nvcc warning : The 'compute_20', 'sm_20', and 'sm_21' architectures are deprecated, and may be removed in a future release (Use -Wno-deprecated-gpu-targets to suppress warning).
CUDA error in func 'main' at line 21 : 2:out of memory.
terminate called after throwing an instance of 'std::runtime_error'
  what():  out of memory
Aborted

Does anybody have any idea why I am unable to allocate more than 1987MB for an array when there is 3718MB of RAM available?

AastaLLL · June 19, 2017, 3:58am

Hi,

We will check this issue and update information to you soon.
Thanks.

AastaLLL · June 20, 2017, 2:35am

Hi,

This is a known limitation on L4T.
Currently, we limit maximal memory of one chunk to be the half size of the physical memory.
That is, in tx1, you can’t allocate GPU memory bigger than ~2G.

This limitation is removed in our next release.
Please wait for our announcement and update.

Thanks.

AastaLLL · July 25, 2017, 1:54am

Hi,

This fix is available now.
Please check JetPack SDK | NVIDIA Developer

Topic		Replies	Views
Cuda Memory Usage TX1 Jetson TX1	8	4524	December 16, 2015
GPU out of memory when the total ram usage is 2.8G Jetson TX2	28	18488	October 18, 2021
cuda-memcheck and available memory Jetson TX1	5	1331	March 7, 2018
Memory on DRAM CUDA Programming and Performance	6	2468	April 28, 2012
cudaMalloc3DArray out of memory can not allocate the available amount of memory CUDA Programming and Performance	3	1805	January 31, 2011
Cannot allocate "all" memory? cudaMalloc fails with 50MB memory left.. CUDA Programming and Performance	9	9569	July 15, 2008
cudaMalloc throwing error with CUT_SAFE_CALL CUDA Programming and Performance	0	1688	August 6, 2009
Memory Error in TX2 Jetson TX2	7	1665	October 18, 2021
Maximal allocatable memory block 1.7 GB is the limit? CUDA Programming and Performance	4	9773	November 18, 2009
Amount of memory available How much memory available to cudaMalloc? CUDA Programming and Performance	1	3515	April 25, 2007

Memory alloc limited to less than half available RAM

Related topics