How can I use createExecutionContextWithoutDeviceMemory()/IExecutionContext::setDeviceMemory()

yjkim2 · April 7, 2021, 9:26am

Hi, all

I am here for asking how to use these two API “createExecutionContextWithoutDeviceMemory()” and “IExecutionContext::setDeviceMemory()” on xavier.

I wrote following codes to use “createExecutionContextWithoutDeviceMemory()”, but it won’t work.

Try #1

IGpuAllocator* allocator;
void* memory;
...
cudaSetDevice(0);
runtime->setGpuAllocator(allocator);
uint64_t memory_size = engine->getDeviceMemorySize();
uint64_t alignment = 512;
memory = allocator->allocate(memory_size, alignment, 0);
context->setDeviceMemory(memory);
...

However, I got Segmentation fault on the line ‘memory = allocator->allocate(memory_size, alignment, 0);’.

==========================================

(I get the value of alignment by cudaGetDeviceProp, which equals to texture memory alignment)
(As a side note, above d_context was supposed to process several inferences on DLA)

==========================================

In TensorRT document, it said
“The memory must be aligned with cuda memory alignment property (using cudaGetDeviceProperties()), and its size must be at least that returned by getDeviceMemorySize(). Setting memory to nullptr is acceptable if getDeviceMemorySize() returns 0.”, here.

However, I couldn’t fully understand what it means or how to set device memory on the context.

Could you show me some example codes for allocating device memory on the context?

or, any help will also be very appreciated.

yjkim.

p.s. I am working on this sample code TensorRT_sample.zip from here.

AastaLLL · April 8, 2021, 3:10am

Hi,

Based on the discussion below, could you try to call the cudaSetDevice first?

https://github.com/NVIDIA/TensorRT/issues/219#issuecomment-559249117

Thanks.

yjkim2 · April 8, 2021, 5:04am

Hi, @AastaLLL .

Thanks for the reply.

I solved this problem by using cudaMalloc.

	cudaMalloc(&memory, engine->getDeviceMemorySize());
	context->setDeviceMemory(memory);

Thanks.

yjkim

Topic		Replies	Views
Deallocate memory assigned using IExecutionContext TensorRT tensorrt	7	663	May 10, 2023
What exactly TensorRT does when calling ExecutionContext::SetDeviceMemory() Jetson AGX Xavier tensorrt	2	1199	October 18, 2021
context_->setDeviceMemory(); segment fault TensorRT	3	785	January 4, 2022
nvinfer1::ICudaEngine::createExecutionContextWithoutDeviceMemory() returns nullptr! TensorRT	5	1300	October 13, 2020
Why IExecutionContext::SetDeviceMemory() takes longer time when the context belongs to DLA Jetson Xavier NX tensorrt	8	1590	October 18, 2021
Does cudasetdevice() allocate memory ？ CUDA Programming and Performance	2	126	July 5, 2024
mEngine->createExecutionContextWithoutDeviceMemory() crashed TensorRT tensorrt	1	363	January 28, 2021
Got out of memory from cudaMemcpy CUDA Programming and Performance	13	4094	January 28, 2022
TensorRt 6 - IExecutionContext->execute cause GPU memory leak TensorRT	9	1553	October 12, 2021
TensorRT engine context use mem TensorRT tensorrt	5	1258	July 5, 2022

How can I use createExecutionContextWithoutDeviceMemory()/IExecutionContext::setDeviceMemory()

Related topics