Deallocate memory assigned using IExecutionContext

romilaggarwal611 · April 10, 2023, 12:44pm

Description

In my application, I load multiple TensorRT engines and depending on certain conditions I use one of them. I wish to reduce memory consumption.
I see that I can use createExecutionContextWithoutDeviceMemory to allocate memory at a latter time but I’m not seeing an option to deallocate the assigned memory. The only option I’m seeing is to delete the current context and create a new one using the function mentioned above.
Is this the appropriate method and if not what alternative method should I follow?

Thanks in advance!

Environment

Jetpack Version: 4.6 / 5.0.2
TensorRT Version: 8.0.1 / 8.4.1
GPU Type: Jetson NX
CUDA Version: 10.2 / 11.4
Operating System + Version: Ubuntu 18.04/20.04

AakankshaS · April 10, 2023, 1:07pm

Hi,

This looks like a Jetson issue. Please refer to the below samples in case useful.

For any further assistance, we will move this post to to Jetson related forum.

Thanks!

romilaggarwal611 · April 10, 2023, 5:54pm

Thanks for the reply!
Sorry, but as I understand it’s not specifically a Jetson issue but related to TensorRT itself.
The same issue will come on other devices with limited RAM.
The main issue is that I’m not able to find an API or class members that I could free to freeup some memory.

the function: createExecutionContextWithoutDeviceMemory allows to allocate memory when we want but I couldn’t find a function to freeup the used up memory.

adam.alcolado · April 17, 2023, 8:36pm

I believe you would use cudaMalloc and cudaFree. You can pass the allocated memory using IExecutionContex::setDeviceMemory(). And you can get the size of memory needed for malloc using ICudaEngine::getDeviceMemorySize().

romilaggarwal611 · April 21, 2023, 1:13pm

Thanks a lot. I had mistaken that this function would allocate the memory itself but on checking the API reference I can see that we provide it the required memory block.
Thanks.

romilaggarwal611 · April 21, 2023, 1:20pm

I have one further question:
The description of the above mentioned function says:
The memory must be aligned with cuda memory alignment property. Would a memory block allocated using cudaMalloc (and size specified using getDeviceMemorySize()) satisfy the condition. I’m new to working with cuda so sorry if i’m asking something basic.

adam.alcolado · April 26, 2023, 9:43pm

I believe cudaMalloc would give you the correct alignment (I think that would be cruel if it didn’t, and they didn’t provide any more information on how to align it properly). But do not, for example, cudaMalloc a large block of memory (p) and pass in an arbitrary sub block of that memory (p+offset), unless you ensure that offset is a multiple of the proper alignment. To be honest, I’m not sure exactly what alignment requirement they are referring to, CUDA Runtime API :: CUDA Toolkit Documentation cudaMalloc documentation just says “The allocated memory is suitably aligned for any kind of variable”

system · May 10, 2023, 9:44pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How can I release GPU memory without terminating the execution process TensorRT tensorrt , python	2	1783	June 10, 2022
Memory deallocation in engine TensorRT	0	608	June 10, 2019
Free TensorRT GPU memory using Python API TensorRT	2	2798	March 23, 2021
How can I use createExecutionContextWithoutDeviceMemory()/IExecutionContext::setDeviceMemory() Jetson AGX Xavier tensorrt	3	1596	October 18, 2021
TensorRT memory management TensorRT tensorrt , cuda , ubuntu	5	3796	March 10, 2022
TensorRT memory issue GPU-Accelerated Libraries	0	414	March 16, 2018
Freeing Device Memory on Context Deletion? CUDA Programming and Performance	1	4623	December 1, 2011
How to force CUDA to use memory freed with glDeleteTextures or the incomprehensible title CUDA Programming and Performance	3	1358	July 19, 2010
Memory leak in IExecutionContext TRT6 TensorRT	1	1283	March 2, 2020
How to manage the GPU memory and Host Memory? Especially release to OS TensorRT	1	624	May 6, 2020

Deallocate memory assigned using IExecutionContext

Description

Environment

Related topics