Ran out of memory on GPU

akankshya · May 6, 2019, 2:32pm

2019-05-06 14:02:27.264629: I tensorflow/stream_executor/dso_loader.cc:152] successfully opened CUDA library libcublas.so.10.0 locally
2019-05-06 14:02:30.554284: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 3.45GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:30.555624: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 3.02GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:30.884528: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 1.72GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:32.482456: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 1.72GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:32.482999: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.54GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:34.231668: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.32GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:35.367897: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.27GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:02:35.949636: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.09GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:05:05.327823: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.11GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.
2019-05-06 14:05:05.629307: W tensorflow/core/common_runtime/bfc_allocator.cc:211] Allocator (GPU_0_bfc) ran out of memory trying to allocate 2.08GiB. The caller indicates that this is not a failure, but may mean that there could be performance gains if more memory were available.

Does this mean that GPU is not being used? I am using a tensorflow:latest-gpu-py3 image

nluehr · May 6, 2019, 8:06pm

The GPU is definitely being used. Such warnings are fairly normal as you push the batch size toward its upper limit. One place they can arise is in autotuning convolution algorithms. Some GPU conv algorithms require allocating temporary workspaces. If allocation of the workspace fails, the above warning is issued, but autotuning continues over GPU algos with smaller workspace requirements.

prithi1201 · July 2, 2021, 2:03am

InvalidArgumentError: Tensor input_image:0, specified in either feed_devices or fetch_devices was not found in the Graph

I’m getting this error and I can’t get it fixed. Could it be due to those warnings stated above?

Topic		Replies	Views
Allocator (GPU_0_bfc) ran out of memory trying to allocate 325.33MiB with freed_by_count=0 Jetson Nano tensorflow , tf-trt , gpu	2	7400	October 15, 2021
GPU out of memory with TensorFlow and JetPack 4.2 Jetson TX2	6	1737	October 18, 2021
Allocator (GPU_0_bfc) Frameworks	0	546	March 11, 2021
Error while allocating memory - Keras/TF Jetson TX1	4	1634	October 18, 2021
CUDA_ERROR_OUT_OF_MEMORY: out of memory when there is actually no such a large tensor to allocate cuDNN	1	12827	December 28, 2019
Out of memory message trying to run cnn network benchmark Frameworks tensorflow	4	3174	April 7, 2018
Resource exhausted: OOM when allocating tensor with shape[181202,512] Report a Bug (PhysicsNeMo Only)	3	2446	March 2, 2022
Unable to utilize all GPU memory when using tensorflow, failed to alloate memory CUDA Programming and Performance	1	1105	October 8, 2018
Resource exhausted: OOM when allocating tensor with shape[256] Jetson Nano tensorrt , tensorflow	6	5322	October 18, 2021
GPU out of memory when the total ram usage is 2.8G Jetson TX2	28	18562	October 18, 2021

Ran out of memory on GPU

Related topics