Cuda Error in createFilterTextureFused

thssljj · June 27, 2018, 6:18am

I use TensorRT4.0.0.3 for Tensorflow 1.8, My model is mobilenetv2, My code was written with Tensorpack(A lib for tensorflow).If I donnot convert to tensorrt graph, My model run successfully.But I convert to Tensorrt graph, It shows:

2018-06-26 15:26:54.048618: I tensorflow/contrib/tensorrt/convert/convert_nodes.cc:2612] starting build engine
2018-06-26 15:26:54.637062: I tensorflow/contrib/tensorrt/convert/convert_nodes.cc:2617] Built network
2018-06-26 15:26:54.637735: I tensorflow/contrib/tensorrt/convert/convert_nodes.cc:2622] Serialized engine
2018-06-26 15:26:54.640263: I tensorflow/contrib/tensorrt/convert/convert_nodes.cc:2630] finished engine bottleneck4/block3/depthconv/my_trt_op36 containing 7 nodes
2018-06-26 15:26:54.640542: I tensorflow/contrib/tensorrt/convert/convert_nodes.cc:2637] Finished op preparation
2018-06-26 15:26:54.640788: I tensorflow/contrib/tensorrt/convert/convert_nodes.cc:2646] OK finished op building for bottleneck4/block3/depthconv/my_trt_op36 on device
convert over!!!!!!!!!!!!!!!!!!!!!!!!
2018-06-26 15:26:54.660024: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1378] Found device 0 with properties:
name: GeForce GTX 1080 Ti major: 6 minor: 1 memoryClockRate(GHz): 1.582
pciBusID: 0000:04:00.0
totalMemory: 10.91GiB freeMemory: 10.53GiB
2018-06-26 15:26:54.660740: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1457] Adding visible gpu devices: 0
2018-06-26 15:26:54.660973: I tensorflow/core/common_runtime/gpu/gpu_device.cc:938] Device interconnect StreamExecutor with strength 1 edge matrix:
2018-06-26 15:26:54.661213: I tensorflow/core/common_runtime/gpu/gpu_device.cc:944]      0
2018-06-26 15:26:54.661410: I tensorflow/core/common_runtime/gpu/gpu_device.cc:957] 0:   N
2018-06-26 15:26:54.661836: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1070] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 3351 MB memory) -> physical GPU (device: 0, name: GeForce GTX 1080 Ti, pci bus id: 0000:04:00.0, compute capability: 6.1)
2018-06-26 15:26:56.636093: E tensorflow/contrib/tensorrt/log/trt_logger.cc:38] DefaultLogger cudnnFusedConvActLayer.cpp (64) - Cuda Error in createFilterTextureFused: 11
terminate called after throwing an instance of 'nvinfer1::CudaError'
  what():  std::exception
Aborted (core dumped)

It shows 37 subgraph convert to tensorrt sucessfully but I get a cuda error and I cannot find any same question with me.
Here is my code to use tensorrt

graph_def = tf.GraphDef()
    with gfile.FastGFile("model.pb",'rb') as f:
        graph_def.ParseFromString(f.read())
    trt_graph=graph_def
    print('Convert_to_trt')
    trt_graph = trt.create_inference_graph(graph_def, OUTPUT_NAMES,
                                         max_batch_size=1,
                                         max_workspace_size_bytes=7000000000,
                                         precision_mode="FP32")  # Get optimized graph
    print("convert over!!!!!!!!!!!!!!!!!!!!!!!!")
    #tf.reset_default_graph()
    g = tf.Graph()
    with g.as_default():
        gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.30)
        with tf.Session(graph=g, config=tf.ConfigProto(gpu_options=gpu_options)) as sess:
            datasets = tf.data.Dataset.from_generator(det_getData, (tf.uint8),
                                                    (tf.TensorShape([1,cfg.img_size, cfg.img_size,3])))
            iterator=datasets.make_one_shot_iterator()
            next_element=iterator.get_next()
            outlist=[]
            out = tf.import_graph_def(
                                    graph_def=trt_graph,
                                    input_map={'Placeholder':next_element},
                                    return_elements=OUTPUT_NAMES)
            for r in out:
                outlist.append(r.outputs[0])

Then I use it with call sess.run(outlist)

Topic		Replies	Views
cudnnFusedConvActLayer.cpp (64) - Cuda Error in createFilterTextureFused: 11 TensorRT	3	559	July 12, 2018
From my frozen graph. I created tensorRT graph. My frozen_graph was working fine on same system. but when I tried same code with tensorRT converted graph. got error described below.. CUDA Programming and Performance	0	332	March 13, 2020
Cuda Error in execute tensorrt GPU-Accelerated Libraries	1	1214	December 18, 2017
TensorRT 5 Bug？cuda/cudaConvolutionLayer.cpp (163) - Cudnn Error in execute: 3 TensorRT	3	2693	June 28, 2019
caffe model convert to tensorrt error TensorRT	0	390	June 13, 2018
tensorrt 3.0.4 sample application issue TensorRT	0	563	May 14, 2018
Using TensorRT3.0 to convert tensorflow model to create TensorRT engine Jetson TX1	3	630	March 8, 2018
subgraph conversion error TensorRT	3	1155	November 18, 2018
Tensorflow-TRT integration not working TensorRT	0	467	June 29, 2018
Cudnn Error in execute: 3 TensorRT	0	683	April 10, 2019

Cuda Error in createFilterTextureFused

Related topics