Inference in Xavier, create tensor on cpu or gpu

kce · July 14, 2020, 10:02am

Hi,

We’re using the Xavier for running inference of images. The network is built with the pytorch framework, and the inference part with pytorch for c++.
Does anyone know if there is any performance gains in creating the tensor on the gpu instead of the cpu? Afaik on regular desktop computers, improvements can be seen by doing this, but since on the Xavier the cpu and gpu share memory maybe this is not worth pursuing.

Thanks!

dusty_nv · July 14, 2020, 3:00pm

Hi @kce, while it is true that the Jetson’s CPU/GPU share the same physical memory, you need to allocate the memory with cudaHostAlloc() and the cudaHostAllocMapped flag in order for the memory to be accessible from both the CPU and GPU address spaces.

For example, if you do just an ordinary malloc() call, that memory will only be accessible from the CPU. I don’t believe PyTorch supports the mapped memory allocation mentioned above, but it does support GPU memory (i.e. cudaMalloc()).

Further, in order for PyTorch to utilize the GPU for processing the network during inferencing, the tensor would need to be on the GPU. So yes, the PyTorch tensor should be created on the GPU. Running the inferencing with CPU-only would be much slower than using GPU.

kce · July 15, 2020, 7:30am

Thanks for the reply!

I see, so it might not be as automated as i assumed in pytorch, i will then try to first create the tensor on the GPU.

Also to clarify, we do use the GPU for inference, its just that we create the tensor on the CPU and move it to the GPU since we use opencv to load the image to be classified, and the version initially supplied from jetpack is not built for CUDA as far as i know.

I will try to buid from source with CUDA enabled, and load images directly to the gpu.

Topic		Replies	Views
Individual Using of CPU and GPU Jetson Xavier NX jetson-inference	2	400	October 18, 2021
Processing on GPU Jetson AGX Xavier	6	593	January 30, 2020
Inference only with CPU Jetson Nano jetson-inference	2	632	November 17, 2021
Geforce RTX 3090 versus Jetson AGX Xavier for inference in AI TensorRT jetson-inference	3	1061	October 12, 2021
Force rendering on CPU on Jetson Xavier Jetson AGX Xavier	6	912	October 18, 2021
Enquiry About Using PyTorch on Jetson for GPU Inference Jetson AGX Xavier jetson-inference	2	31	August 6, 2024
Excessive RAM usage Jetson Xavier NX pytorch , docker-machine-learning	4	880	February 12, 2024
Jetson AGX Xavier GPU RAM usage for object detection and instance segmentation inferencing Jetson AGX Xavier tensorrt , jetson-inference , pytorch , onnx	2	890	May 13, 2022
Jetson AGX Xavier: slow inference using CUDA and PyTorch Jetson AGX Xavier cuda , pytorch	4	1632	October 18, 2021
Speeding up examples Jetson Xavier NX tensorrt	4	877	October 18, 2021

Inference in Xavier, create tensor on cpu or gpu

Related topics