bypass wddm restrictions how to allocate memory for a huge 3d texture

Tobi84 · January 11, 2011, 8:03am

Hi,

System: Win 7 x64
System Memory: 12 GB
CPU: Core i7 920 @ 2.67 GHz
Chipset: x58

Cuda Device: Quadro FX 5800 and Tesla C1060 each with 4 GB of Memory
Toolkit Version: 3.2
SDK Version: 3.2
C++: MS Visual Studio 2008 SP1

Same configuration also running in Ubuntu 10.04 LTS x64

I need to visualize some big amount of volume data using a 3D texture. The volume right now has a size of 2.5 GB but
there will be more volumes of even bigger size in the near future to be visualized.
Using Ubuntu my application works as expected and i am able to use the full amount of memory the cuda devices provide.

Unfortunately under windows the wddm throws a monkey wrench in my plans…

Using nvidia-smi I was able to activate the compute only mode for the tesla cards and can now allocate the full amount
of memory on these bords. But i really need to be able to also use the full memory size of the quadro card.

Is it possible to bypass wddm on quadro or gforce boards and allocate more memory than wddm allows?
I know it is possible to allocate more memory in smaller pieces (eg. allocate 3 times 1 GB instead of allocating 1 time 3GB which wddm permits…)
so if i need 2.5 GB for the volume data. is it in any way possible to allocate this memory in smaller parts and after that map the texture reference
to this memory parts, having the tex refernce behave as normal?
Is the upper limit of allocating memory exactly the same in all cases (somehow defined in the driver) or is it different, depending on how many
device memory a specific card provides and, if this is the case, is it a fixed percental value or can i somehow check for the max value allowed in my code?

Hopefully someone here can help me out of this situation (and maybe even explain to me what the sense of this restriction of allocating memory is).

Thanks and kind regards,

Tobi

Tobi84 · January 17, 2011, 1:21pm

Hi,

i was just wondering… Is it not possible what i was asking or is the question that stupid?
I would appreciate any advice or hint in this case…
I just need to find a way to use textures of more than 3 GByte on a non Tesla device under windows 7.

Thanks
Tobi

ParallelPoint · February 1, 2011, 6:33am

I’m curious. What was the most memory you were able to allocate in one call?

Tobi84 · February 1, 2011, 12:16pm

Hi,

as some weeks passed since I last tried on the windows machine i am not
absolutely sure at the moment but my last note on one of my scribbling paper
says it was about 1798045696 bytes. so round about 1.6 GByte.
Ya,I’m quite sure that is the maximum value i reached…

[Edit]:

Here you can find the detailed list of tries. In another post i have made because
i didn’t know that it was related to the vista wddm on that time…

http://forums.nvidia.com/index.php?showtopic=189642&st=0&p=1170285&#entry1170285

ParallelPoint · February 1, 2011, 7:18pm

Like you, I’m trying to figure out what the “magic number” is for single memory allocations. Things wouldn’t be bad if we had a general formula so we were aware when developing or moving code to different cards.

I posted this in another thread, and I thought I was making sense, but in your case you have 12 GB of system memory and you were only able to do ~1.6 GB allocations. (Which makes what I said wrong). I really wish they would clarify how the allocation limit is decided and not black boxing everything.

ParallelPoint · February 1, 2011, 7:32pm

Ok, so I’m really frustrated by the confusion so I did a little more digging.

It looks like there is a flag in the driver that sets the segment commmit limit, and there is also a global commit limmit. I’m thinking memory allocations are capped to whichever limit is lower.

Section of interest:

“The video memory manager will not allow the per segment nor the global commit limit to be violated. If a particular segment as a commit limit of 1 GB but the global commit limit is 256 MB, the video memory manager will not allow more than 256 MB of system memory to be mapped into that segment.”

Now the question still is… how do we know what those values are set to ahead of time?

Microsoft Reference Document

Commit limit on aperture segment

The amount of system memory that the video memory manager will allow to be pinned down (that is, memory mapped through an aperture segment) for GPU usage at any given instant. The total amount of system memory allocated for the GPU might exceed the commit limit greatly; however, the video memory manager will ensure that only up to a commit limit amount is actually resident in an aperture segment at any one time.

By default, the commit limit on a particular aperture segment is the size of that segment. The driver can specify a different commit limit when it enumerates the segment by using the DXGK_SEGMENTDESCRIPTOR.CommitLimit field. A commit limit specified in such a way applies only to the particular segment being enumerated.

In addition to per segment commit limit, there is a global commit limit on all aperture segment. This global commit limit is also referred to as shared system memory. This value is computed by the video memory manager but can be reduced to a lower value by the driver using the DXGK_DRIVERCAPS. ApertureSegmentCommitLimit capability bits; however, this is not recommended.

The video memory manager will not allow the per segment nor the global commit limit to be violated. If a particular segment as a commit limit of 1 GB but the global commit limit is 256 MB, the video memory manager will not allow more than 256 MB of system memory to be mapped into that segment.

Topic		Replies	Views
cudaMalloc3DArray out of memory can not allocate the available amount of memory CUDA Programming and Performance	3	1885	January 31, 2011
cuMemAlloc limited to 1/4 total GPU memory? CUDA Programming and Performance	10	12967	April 1, 2010
How much GPU memory can cudaMalloc get? CUDA Programming and Performance	17	15725	April 2, 2022
GTX580 3GB memory from Palit has any problems? CUDA Programming and Performance	13	7533	March 3, 2011
maximum allocation size on windows 7 CUDA Programming and Performance	2	11675	April 1, 2010
cudaMalloc Incorrectly Reporting out of Memory Can't allocate more than 1151 MB at a time. CUDA Programming and Performance	6	2765	February 1, 2011
Max cudaMalloc under Windows 7 is there a restriction? CUDA Programming and Performance	1	2007	November 4, 2009
Is there a maximum number of graphics driver allocations of GPU memory in CUDA?(there is a limit in Vulkan, OpenGL) CUDA Programming and Performance	0	349	December 21, 2023
Maximal allocatable memory block 1.7 GB is the limit? CUDA Programming and Performance	4	9870	November 18, 2009
Can't allocate huge memory blocks - PAGING_BUFFER_SEGMENT_SIZE? CUDA Programming and Performance	0	1808	February 14, 2013

bypass wddm restrictions how to allocate memory for a huge 3d texture

Related topics