Unable to run inference on tensorflow model

jansoft88 · April 22, 2021, 6:12pm

Hi all. I’m trying to get some of our object detection models working on the TX2. Most of our models are in the keras h5 format. We’re trying to get to a point where we can load these models on our TX2 and do inference. So far I’ve followed an nvidia guide to setup cuda and tensorrt (and as far as I can tell both are present and functional) and for installing tensorflow. From there I tried following this guide to convert our model into a tensor rt compatible format (Tensorflow 2, using savedmodel). The model seems to save and load (though it takes a long time) just fine as far as I can tell. When I try to run inference, however, I get an error saying that too many resources were requested.

> tensorflow.python.framework.errors_impl.InvalidArgumentError: cannot compute __inference_signature_wrapper_65074 as input #0(zero-based) was expected to be a float tensor but is a double tensor [Op:__inference_signature_wrapper_65074]
> >>> f(single_class_input=arr.astype("float32"))
> 2021-04-13 23:10:41.654211: I tensorflow/compiler/tf2tensorrt/common/utils.cc:58] Linked TensorRT version: 7.1.3
> 2021-04-13 23:10:41.895634: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libnvinfer.so.7
> 2021-04-13 23:10:41.895849: I tensorflow/compiler/tf2tensorrt/common/utils.cc:60] Loaded TensorRT version: 7.1.3
> 2021-04-13 23:10:41.914211: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libnvinfer_plugin.so.72021-04-13 23:11:28.899038: F tensorflow/core/kernels/image/[resize_bilinear_op_gpu.cu.cc:447](http://resize_bilinear_op_gpu.cu.cc:447/)] Non-OK-status: GpuLaunchKernel(kernel, config.block_count, config.thread_per_block, 0, d.stream(), config.virtual_thread_count, images.data(), height_scale, width_scale, batch, in_height, in_width, channels, out_height, out_width, output.data()) status: Internal: too many resources requested for launch

Note that I’ve tried this with both F32 and F16. Usually at this point the process hangs for a while, and then crashes saying “Aborted (core dumped)”. I’m not quite sure where to go from here. The model is based on mobilenet v2 and is pretty small (2 million params or whereabouts). Any idea where things might be going wrong?

AastaLLL · April 23, 2021, 3:28am

Hi,

This is a known issue for Nano user.
Please check the detailed information below:

We are checking this issue internally.
Will let you know once we got any progress.

Thanks.

jansoft88 · April 23, 2021, 11:24am

Ok. Just to be clear I’m using a TX2, not a nano.

Topic		Replies	Views
Faster R-CNN: too many resources requested for launch Jetson TX2	27	7532	September 14, 2018
YOLOv4 on Jetson Nano using Tensorflow: Internal: too many resources requested for launch Jetson Nano cuda , tensorflow , nvbugs , yolo	2	1378	October 18, 2021
Jetson Nano Inferrence Yolov4 tiny Jetson Nano yolo	7	1306	October 15, 2021
Available: TensorFlow 1.5 for Jetson TX2 Jetson TX2	18	8099	May 21, 2018
RuntimeError : cuda runtime error Jetson TX2	5	2437	October 18, 2021
Problem in tensorflow on jetson nano 2gb Jetson Nano tensorflow , nano2gb	5	1770	October 15, 2021
get stuck running deep learning model on Jetson TX2 Jetson TX2	5	1152	May 31, 2018
Increase the inference on jetson nano using tensort Jetson TX2 tensorrt , jetson-inference	2	485	November 17, 2021
Getting CUDA Error while runnin inference in Jetson Nano 2GB Jetson Nano tensorrt , nano2gb	4	558	October 15, 2021
Tiny YOLOv4 TensorRT - too many resources requested for launch on 4GB Nano Jetson Nano tensorrt , nvbugs , yolo	11	3631	July 7, 2021

Unable to run inference on tensorflow model

Related topics