Unexpected limit in cudaHostAlloc Failing to allocate large amounts of pinned/page-locked memory

Simon1 · November 25, 2010, 7:08pm

Hi,

I am trying to allocate large amounts (several GB … up to ~10GB) of pinned memory using cudaHostAlloc and I seem to hit an unexpected limit on some machines.

The system I am having problems with:

[*]Phenom II X6

[*]12GB DDR3 RAM

[*]GTX 480

[*]Windows 7 64bit

[*]cudatoolkit_3.2.16_win_64

[*]devdriver_3.2_winvista-win7_64_263.06_general

[*]gpucomputingsdk_3.2.16_win_64

[*]Parallel Nsight 1.5

[*]Visual Studio 2010

I expected to be able to allocate at least 8-10 GB of pinned/page-locked memory through CUDA but I seem to hit a limit at around 700 MB.

I tried allocating blocks of different sizes (e.g. all at once, many blocks of size 32, 64, 128MB …) but the limit seems to remain the same.

I also tried the latest end-user driver and some previous CUDA versions with the same effect.

My project is compiled for x64 using the v90 platform toolset.

I also followed this article to ensure that the operating system enforced limits for the non-paged pool are correct. (Process Explorer states 9.x GB as the the Nonpaged Limit)

On another machine with lower specs (PhenomX4, 4GB RAM, GTX 275, same software stack) I could at least manage to allocate around 1400MB of pinned memory which is not perfect but better.

I am trying to figure out what causes this limit and how to resolve or work around it.

Kind regards

Simon

Simon1 · November 25, 2010, 7:08pm

Hi,

I am trying to allocate large amounts (several GB … up to ~10GB) of pinned memory using cudaHostAlloc and I seem to hit an unexpected limit on some machines.

The system I am having problems with:

[*]Phenom II X6

[*]12GB DDR3 RAM

[*]GTX 480

[*]Windows 7 64bit

[*]cudatoolkit_3.2.16_win_64

[*]devdriver_3.2_winvista-win7_64_263.06_general

[*]gpucomputingsdk_3.2.16_win_64

[*]Parallel Nsight 1.5

[*]Visual Studio 2010

I expected to be able to allocate at least 8-10 GB of pinned/page-locked memory through CUDA but I seem to hit a limit at around 700 MB.

I tried allocating blocks of different sizes (e.g. all at once, many blocks of size 32, 64, 128MB …) but the limit seems to remain the same.

I also tried the latest end-user driver and some previous CUDA versions with the same effect.

My project is compiled for x64 using the v90 platform toolset.

I also followed this article to ensure that the operating system enforced limits for the non-paged pool are correct. (Process Explorer states 9.x GB as the the Nonpaged Limit)

On another machine with lower specs (PhenomX4, 4GB RAM, GTX 275, same software stack) I could at least manage to allocate around 1400MB of pinned memory which is not perfect but better.

I am trying to figure out what causes this limit and how to resolve or work around it.

Kind regards

Simon

crakinshot · December 6, 2010, 11:33am

I have the same problem. Any chance this is a bug? I have 12GB of RAM, Windows 7 64-bit. I’m using CUDA driver via CUDA.Net bindings. I’m not even getting to 1GB and I’m getting out of memory exceptions.

mfatica · December 6, 2010, 4:38pm

It is not a bug but a Windows Vista/7 “feature”.

From the release notes:

o The maximum size of a single allocation created by cudaMalloc or cuMemAlloc is limited to:
MIN ( ( System Memory Size in MB - 512 MB ) / 2, PAGING_BUFFER_SEGMENT_SIZE )
For Vista, PAGING_BUFFER_SEGMENT_SIZE is approximately 2GB.

Topic		Replies	Views
Out Of Memory Error Allocating large chunks (> 1GB) of pinned-memory fails CUDA Programming and Performance	3	5888	June 4, 2011
Max amount of host pinned memory available for allocation CUDA Programming and Performance	8	8373	February 4, 2021
amount of pinned memory CUDA Programming and Performance	17	12335	December 4, 2008
Change limit of 50% for cudaHostAlloc pinned memory on Windows 10/11 CUDA Programming and Performance	9	3145	September 19, 2022
estimate an upper limit for pinned memory (windows, linux) - how ? CUDA Programming and Performance	4	1687	September 5, 2017
Significant decrease of available page-locked memory at Win7 x64 vs. Win7 x32 CUDA Programming and Performance	3	5702	June 18, 2011
Arbitrary Device Limit On Pinned Host Memory CUDA Programming and Performance	8	2089	August 26, 2014
Pinned memory limit CUDA Programming and Performance	16	13690	May 1, 2016
Maximum of page-locked memory? CUDA Programming and Performance	2	5676	August 17, 2009
cudaHostAlloc can only allocate about 3.5GB of memory out of 128GB CUDA Programming and Performance	7	498	June 2, 2023

Unexpected limit in cudaHostAlloc Failing to allocate large amounts of pinned/page-locked memory

Related topics