when i run a tensorflow model there is not enough memory ,what shoud i do

742824147 · November 2, 2017, 8:37am

when i run a segmentation model with tensorflow on jetson tx2, a error occur: there is not enough memory to keep running ,what shoud i do

AastaLLL · November 3, 2017, 3:21am

Hi,

Depends on the required memory is for GPU or CPU.

For TX2, GPU allocatable memory limit to 8G.
But if the required memory is for CPU, you can try to add some swap memory.

# Create a swapfile for Ubuntu at the current directory location
fallocate -l 8G swapfile
# List out the file
ls -lh swapfile
# Change permissions so that only root can use it
chmod 600 swapfile
# List out the file
ls -lh swapfile
# Set up the Linux swap area
mkswap swapfile
# Now start using the swapfile
sudo swapon swapfile
# Show that it's now being used
swapon -s

Thanks.

dbrownrxvs0 · November 5, 2017, 2:31am

Which model is it throwing this on? I’ve been doing research on TX2’s and Tensorflow, and am hearing murmurs that some models play well and others don’t, but I don’t know what the root of that (if it’s valid) is yet.

I assume you’re talking just GPU as well. Trying to run Tensorflow on a swap file is… a choice… that I wouldn’t ever recommend.

AastaLLL · November 6, 2017, 1:59am

Hi dbrownrxvs0,

Thanks for your reply.

If all the network is allocated on GPU, swap is no help. Tx2 GPU memory is limited to 8G.
But if you have checked the TensorFlow model placement, sometimes serval layers are put on CPU although you are in GPU mode.
In this case, swap may help.(But still network-dependent)

That’s why we always recommend giving swap a try.
Thanks.

a_phan · December 19, 2017, 8:44am

I would like to add to AastaLLL’s answer:

When memory is not enough and you don’t have a swap: Tensorflow will exit with “Killed” message

When swap is on, you should also add this to /etc/sysctl.conf: vm.min_free_kbytes=65536 (to keep free at less 6% of total Memory/ number of cores)

The reason for that is sometimes when memory is almost at full but the system still try to load the swap into the memory and cause a hang/freeze

AastaLLL · December 29, 2017, 5:57am

Thanks for the sharing, a_phan.

ardianumam · January 22, 2019, 6:23am

AastaLLL:

Hi,

Depends on the required memory is for GPU or CPU.

For TX2, GPU allocatable memory limit to 8G.
But if the required memory is for CPU, you can try to add some swap memory.
# Create a swapfile for Ubuntu at the current directory location
fallocate -l 8G swapfile
# List out the file
ls -lh swapfile
# Change permissions so that only root can use it
chmod 600 swapfile
# List out the file
ls -lh swapfile
# Set up the Linux swap area
mkswap swapfile
# Now start using the swapfile
sudo swapon swapfile
# Show that it's now being used
swapon -s
Thanks.

Using this method, I can optimize YOLOv3 frozen graph (https://github.com/ardianumam/Tensorflow-TensorRT) in Jetson TX2 to TensorRT graph. Without making swap memory, TX2 will run out memory only to restore YOLOv3 graph. But, the TRT-graph result needs long time (~15 minutes) only to read/load/restore it, while the original tensorflow frozen model of the YOLOv3, it needs only 5 seconds to restore the graph. Anything we can do to solve this issue?

AastaLLL · January 25, 2019, 6:13am

Hi,

TensorRT will compile a model into TensorRT PLAN before launching.
This will takes some time since the optimization is pretty complicated.

You can serialize a TenosrRT PLAN and launch it without re-compiling next time.
Check this document for more information:
[url]https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#work[/url]

Thanks.

dariusz.filipski · February 18, 2019, 10:34am

If your code stuck on ParseFromString() then you’re most likely hit by the slowness of the protobuf python backend. Check
https://devtalk.nvidia.com/default/topic/1046492/tensorrt/extremely-long-time-to-load-trt-optimized-frozen-tf-graphs/
and start with

export PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=cpp

before running your code.

kayccc · February 21, 2019, 3:29am

If your code stuck on ParseFromString() then you’re most likely hit by the slowness of the protobuf python backend. Check
Extremely long time to load TRT-optimized frozen TF graphs - TensorRT - NVIDIA Developer Forums
and start with
export PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=cpp
before running your code.

Thanks for sharing.

rahulvijaysoans231444 · August 23, 2021, 11:12am

hello,
i have followed your steps and the total swap memory increased to 10gb, however i see no difference in the performance, the tensorflow overloads the main memory in this case. how can i restore my swap to original memory.
thank you

kayccc · September 1, 2021, 3:02am

Hi rahulvijaysoans231444,

This is an old thread, please help to open a new topic with more details.

Thanks

Topic		Replies	Views
Strategy: how to overcome GPU Out-of-Memory? Jetson Nano opencv , cuda , tensorflow	9	3769	October 18, 2021
Unable to run optimised network on Tx2 using tensorflow gpu Jetson TX2	4	451	November 7, 2019
Make full use of the swapfile Jetson Nano tensorrt	6	908	June 2, 2023
GPU and RAM usage on Jetson TX2 Jetson TX2 yolo	4	961	October 18, 2021
Manage TensorRT GPU memory conversion usage TensorRT tensorrt , tensorflow , ubuntu	3	2646	April 7, 2021
GPU out of memory when the total ram usage is 2.8G Jetson TX2	28	18550	October 18, 2021
Jetson TX2 doesn't seem to be using allocated swap space when running inference Jetson TX2	6	2402	October 18, 2021
Memory error for tensorRT model on TX2 Jetson TX2 tensorrt	6	1465	January 5, 2022
Converting tf model on jetson tx2 is slow TensorRT	14	1227	June 24, 2020
Memory issue while trying to run a script Jetson Xavier NX tensorrt , tensorflow , jetson-inference	7	2153	October 18, 2021

when i run a tensorflow model there is not enough memory ,what shoud i do

Related topics