Excessive GPU memory consumption of "Tensorflow.keras.metrics.Metric" objects

hyunsungkim · April 17, 2020, 4:43am

I’m really new to tensorflow and just found something unusual when instantiating a keras metric object as follows.

import tensorflow as tf
m = tf.keras.metrics.Mean(name='test')

Once executing two lines above in python, GPU memory consumption soars from 0% to around 95% (about 10GiB) in a moment. And it never goes down until I terminate the program or delete the instance. I checked it on nvtop gpu monitor.

My machine is Ubuntu server with eight RTX2080Ti GPUs equipped. Plus, I’m using docker image provided by the Nvidia NGC (specifically, nvcr.io/nvidia/tensorflow:20.03-tf2-py3)

I observed the same issue on TitanXp machine. And another docker image (nvcr.io/nvidia/tensorflow:20.01-tf2-py3) showed the same issue.

Do you guys get the same issue? Is it a bug of tensorflow or the docker image?

hyunsungkim · April 20, 2020, 8:06am

I just found out that it was because I didn’t set gpu memory growth option. So nothing was the problem with tensorflow or docker images but my ignorance about basic usage of tensorflow.

For noobs who suffered from the same problem, enable gpu memory growth option by following python code.

import os
os.environ["TF_FORCE_GPU_ALLOW_GROWTH"] = "true"

Refer to tensorflow official guid: Use a GPU | TensorFlow Core

Topic		Replies	Views
Tensorflow:17.10 image doesn't work out of the box Frameworks tensorflow	6	3060	October 12, 2021
Error while allocating memory - Keras/TF Jetson TX1	4	1633	October 18, 2021
2GB Protobuf limit Frameworks tensorflow	0	1013	February 17, 2020
Difference of memory usage at each GPU model during tensorflow c++ inference Frameworks tensorflow	3	1546	November 20, 2019
Unable to utilize all GPU memory when using tensorflow, failed to alloate memory CUDA Programming and Performance	1	1105	October 8, 2018
Keras with Tensorflow backend - NN training on GPU is almost 10 times slower than CPU Frameworks tensorflow	4	1581	April 11, 2020
GTX 1080 doesn't release memory Linux	1	1155	November 5, 2017
Orin NX tensorflow - high memory use Jetson Orin NX tensorflow	3	425	January 3, 2024
tensorflow-gpu not using gpu? Jetson TX2	4	4298	October 18, 2021
GPU memory usage is too high with Keras Jetson Nano	4	1764	October 18, 2021

Excessive GPU memory consumption of "Tensorflow.keras.metrics.Metric" objects

Related topics