Libprotobuf FATAL Error on Tensorflow Training

guneet · April 2, 2021, 7:34am

Description

A clear and concise description of the bug or issue.

Environment

TensorRT Version:
GPU Type: Tesla T4
Nvidia Driver Version: 450.80.02
CUDA Version: 11.0
CUDNN Version:
Operating System + Version: Ubuntu + 18.04
Python Version (if applicable): 3.6
TensorFlow Version (if applicable): 2.4.1
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

ERROR

IVER_API, cbid)failed with error CUPTI could not be loaded or symbol could not be found. 2021-03-08 08:25:35.572927: W tensorflow/core/framework/cpu_allocator_impl.cc:81] Allocation of 2247026860 exceeds 10% of free system memory.
[libprotobuf FATAL external/com_google_protobuf/src/google/protobuf/wire_format_lite.cc:523] CHECK failed: (value.size()) <= (kint32max):

I get this issue when I use a large dataset for training. I have tried reducing the batch size but this also didn’t help. On the other hand, if I reduce my training data, it works.
Can you guide me what is the reason behind this error? Is this because there is some training data limit? Or some other reason?

My script uses MirroredStrategy and deals with the distributed dataset to achieve data parallelism among the workers.

NVES · April 2, 2021, 7:37am

Hi,
We recommend you to check the below samples links, as they might answer your concern

If issue persist, request you to share the model and script so that we can try reproducing the issue at our end.
Thanks!

spolisetty · April 5, 2021, 8:18am

Hi @guneet.

Based on the above description, It doesn’t look like TensorRT related issue.
You may get better help here,

Thank you.

mukulranjaniitg · April 29, 2022, 2:47am

@guneet How were you able to solve this issue? I am also getting same error and it works if I reduce the dataset.

Topic		Replies	Views
Can't perform inference using Python API of TensorRT TensorRT inference-fil-spark	1	631	November 20, 2020
Cuda Error when running Tensorrt 3 on complete test set. TensorRT	3	862	May 2, 2018
caffe model convert to tensorrt error TensorRT	0	390	June 13, 2018
Tensorflow 1.7 with TensorRT fails Jetson TX2	13	3859	October 18, 2021
Tensorflow-TRT integration not working TensorRT	0	467	June 29, 2018
TensorRT not improving FPS on GTX 1080ti TensorRT	9	2444	November 21, 2018
Cuda Error in execute tensorrt GPU-Accelerated Libraries	1	1214	December 18, 2017
TRT5.0: Memory error when building engine TensorRT	8	6098	October 31, 2018
Error in TFTRT TensorRT	9	3421	June 22, 2020
error while using TensorRT TensorRT	1	1258	January 10, 2020

Libprotobuf FATAL Error on Tensorflow Training

Description

Environment

ERROR

Related topics