onnx version - 1.11.0 onnxsimplifier version - 0.3.7 onnxruntime version - 1.10.0 Onnx Runtime device - GPU onnx model IR version - 7 onnx model producer name - onnx model producer version - Model inputs: [name: "input" type { tensor_type { elem_type: 1 shape { dim { dim_param: "batch" } dim { dim_param: "channels" } dim { dim_param: "rows" } dim { dim_param: "cols" } } } } ] CAUTION!!! - Tensor input name - input , dimension - batch , set its value to 1 for Onnx simplify operation CAUTION!!! - Tensor input name - input , dimension - channels , set its value to 1 for Onnx simplify operation CAUTION!!! - Tensor input name - input , dimension - rows , set its value to 1 for Onnx simplify operation CAUTION!!! - Tensor input name - input , dimension - cols , set its value to 1 for Onnx simplify operation Model outputs: [name: "mask" type { tensor_type { elem_type: 1 shape { dim { dim_param: "Castmask_dim_0" } dim { dim_param: "Castmask_dim_1" } } } } , name: "score_map" type { tensor_type { elem_type: 1 shape { dim { dim_param: "Reshapescore_map_dim_0" } dim { dim_param: "Reshapescore_map_dim_1" } } } } , name: "dense_feat_map" type { tensor_type { elem_type: 1 shape { dim { dim_param: "batch" } dim { dim_value: 128 } dim { dim_param: "unk__59" } dim { dim_param: "unk__60" } } } } ] trt version - 8.4.0.6 pandas version - 1.1.5 INSTALLED VERSIONS ------------------ commit : b5958ee1999e9aead1938c0bba2b674378807b3d python : 3.6.8.final.0 python-bits : 64 OS : Windows OS-release : 10 Version : 10.0.19041 machine : AMD64 processor : Intel64 Family 6 Model 165 Stepping 5, GenuineIntel byteorder : little LC_ALL : None LANG : None LOCALE : None.None pandas : 1.1.5 numpy : 1.19.5 pytz : 2021.3 dateutil : 2.8.2 pip : 18.1 setuptools : 40.6.2 Cython : None pytest : None hypothesis : None sphinx : None blosc : None feather : None xlsxwriter : None lxml.etree : None html5lib : None pymysql : None psycopg2 : None jinja2 : None IPython : None pandas_datareader: None bs4 : None bottleneck : None fsspec : None fastparquet : None gcsfs : None matplotlib : 3.3.4 numexpr : None odfpy : None openpyxl : None pandas_gbq : None pyarrow : None pytables : None pyxlsb : None s3fs : None scipy : 1.5.4 sqlalchemy : None tables : None tabulate : None xarray : None xlrd : None xlwt : None numba : None cuda version - (11, 4, 0) pycuda curand version - (10, 20, 5) pycuda driver version - 11060 Detected 1 CUDA Capable device(s) Device 0: NVIDIA GeForce RTX 3090 Compute Capability: 8.6 Total Memory: 24575 megabytes TRT - INFO [MemUsageChange] Init CUDA: CPU +394, GPU +0, now: CPU 10419, GPU 2363 (MiB) TRT - INFO [MemUsageSnapshot] Begin constructing builder kernel library: CPU 10622 MiB, GPU 2363 MiB TRT - INFO [MemUsageSnapshot] End constructing builder kernel library: CPU 11018 MiB, GPU 2487 MiB TRT - INFO [MemUsageChange] Init CUDA: CPU +0, GPU +0, now: CPU 10862, GPU 2487 (MiB) TRT - INFO [MemUsageChange] Init CUDA: CPU +0, GPU +0, now: CPU 10862, GPU 2487 (MiB) TRT - VERBOSE Registered plugin creator - ::GridAnchor_TRT version 1 TRT - VERBOSE Registered plugin creator - ::GridAnchorRect_TRT version 1 TRT - VERBOSE Registered plugin creator - ::NMS_TRT version 1 TRT - VERBOSE Registered plugin creator - ::Reorg_TRT version 1 TRT - VERBOSE Registered plugin creator - ::Region_TRT version 1 TRT - VERBOSE Registered plugin creator - ::Clip_TRT version 1 TRT - VERBOSE Registered plugin creator - ::LReLU_TRT version 1 TRT - VERBOSE Registered plugin creator - ::PriorBox_TRT version 1 TRT - VERBOSE Registered plugin creator - ::Normalize_TRT version 1 TRT - VERBOSE Registered plugin creator - ::ScatterND version 1 TRT - VERBOSE Registered plugin creator - ::RPROI_TRT version 1 TRT - VERBOSE Registered plugin creator - ::BatchedNMS_TRT version 1 TRT - VERBOSE Registered plugin creator - ::BatchedNMSDynamic_TRT version 1 TRT - VERBOSE Registered plugin creator - ::FlattenConcat_TRT version 1 TRT - VERBOSE Registered plugin creator - ::CropAndResize version 1 TRT - VERBOSE Registered plugin creator - ::DetectionLayer_TRT version 1 TRT - VERBOSE Registered plugin creator - ::EfficientNMS_TRT version 1 TRT - VERBOSE Registered plugin creator - ::EfficientNMS_ONNX_TRT version 1 TRT - VERBOSE Registered plugin creator - ::EfficientNMS_Explicit_TF_TRT version 1 TRT - VERBOSE Registered plugin creator - ::EfficientNMS_Implicit_TF_TRT version 1 TRT - VERBOSE Registered plugin creator - ::Proposal version 1 TRT - VERBOSE Registered plugin creator - ::ProposalLayer_TRT version 1 TRT - VERBOSE Registered plugin creator - ::PyramidROIAlign_TRT version 1 TRT - VERBOSE Registered plugin creator - ::ResizeNearest_TRT version 1 TRT - VERBOSE Registered plugin creator - ::Split version 1 TRT - VERBOSE Registered plugin creator - ::SpecialSlice_TRT version 1 TRT - VERBOSE Registered plugin creator - ::InstanceNormalization_TRT version 1 TRT - VERBOSE Registered plugin creator - ::InstanceNormalization_TRT version 2 TRT - VERBOSE Adding network input: input with dtype: float32, dimensions: (-1, -1, -1, -1) TRT - VERBOSE Registering tensor: input for ONNX tensor: input TRT - VERBOSE Importing initializer: 35 TRT - WARNING onnx2trt_utils.cpp:365: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32. TRT - VERBOSE Importing initializer: 38 TRT - VERBOSE Importing initializer: 487 TRT - VERBOSE Importing initializer: 488 TRT - VERBOSE Importing initializer: descriptor_map.layer_01.3.weight TRT - VERBOSE Importing initializer: 46 TRT - VERBOSE Importing initializer: 47 TRT - VERBOSE Importing initializer: descriptor_map.layer_23.0.running_mean TRT - VERBOSE Importing initializer: descriptor_map.layer_23.0.running_var TRT - VERBOSE Importing initializer: 490 TRT - VERBOSE Importing initializer: 491 TRT - VERBOSE Importing initializer: descriptor_map.layer_23.5.weight TRT - VERBOSE Importing initializer: 56 TRT - VERBOSE Importing initializer: 57 TRT - VERBOSE Importing initializer: descriptor_map.layer_45.0.running_mean TRT - VERBOSE Importing initializer: descriptor_map.layer_45.0.running_var TRT - VERBOSE Importing initializer: 493 TRT - VERBOSE Importing initializer: 494 TRT - VERBOSE Importing initializer: 496 TRT - VERBOSE Importing initializer: 497 TRT - VERBOSE Importing initializer: 499 TRT - VERBOSE Importing initializer: 500 TRT - VERBOSE Importing initializer: 502 TRT - VERBOSE Importing initializer: 503 TRT - VERBOSE Importing initializer: descriptor_map.layer_678.6.weight TRT - VERBOSE Importing initializer: 82 TRT - VERBOSE Importing initializer: 85 TRT - VERBOSE Importing initializer: 89 TRT - VERBOSE Importing initializer: 91 TRT - VERBOSE Importing initializer: 504 TRT - VERBOSE Importing initializer: 95 TRT - VERBOSE Importing initializer: 98 TRT - VERBOSE Importing initializer: 104 TRT - VERBOSE Importing initializer: 107 TRT - VERBOSE Importing initializer: 102 TRT - VERBOSE Importing initializer: 111 TRT - VERBOSE Importing initializer: 115 TRT - VERBOSE Importing initializer: 117 TRT - VERBOSE Importing initializer: 505 TRT - VERBOSE Importing initializer: 142 TRT - VERBOSE Importing initializer: 144 TRT - VERBOSE Importing initializer: 146 TRT - VERBOSE Importing initializer: 150 TRT - VERBOSE Importing initializer: 152 TRT - VERBOSE Importing initializer: 149 TRT - VERBOSE Importing initializer: 163 TRT - VERBOSE Importing initializer: 165 TRT - VERBOSE Importing initializer: 170 TRT - VERBOSE Importing initializer: 171 TRT - VERBOSE Importing initializer: 169 TRT - VERBOSE Importing initializer: 178 TRT - VERBOSE Importing initializer: 181 TRT - VERBOSE Importing initializer: 184 TRT - VERBOSE Importing initializer: 190 TRT - VERBOSE Importing initializer: 193 TRT - VERBOSE Importing initializer: 188 TRT - VERBOSE Importing initializer: 197 TRT - VERBOSE Importing initializer: 201 TRT - VERBOSE Importing initializer: 203 TRT - VERBOSE Importing initializer: 511 TRT - VERBOSE Importing initializer: 228 TRT - VERBOSE Importing initializer: 231 TRT - VERBOSE Importing initializer: 235 TRT - VERBOSE Importing initializer: 237 TRT - VERBOSE Importing initializer: 234 TRT - VERBOSE Importing initializer: 248 TRT - VERBOSE Importing initializer: 250 TRT - VERBOSE Importing initializer: 255 TRT - VERBOSE Importing initializer: 256 TRT - VERBOSE Importing initializer: 254 TRT - VERBOSE Importing initializer: 263 TRT - VERBOSE Importing initializer: 266 TRT - VERBOSE Importing initializer: 269 TRT - VERBOSE Importing initializer: 275 TRT - VERBOSE Importing initializer: 278 TRT - VERBOSE Importing initializer: 273 TRT - VERBOSE Importing initializer: 282 TRT - VERBOSE Importing initializer: 286 TRT - VERBOSE Importing initializer: 288 TRT - VERBOSE Importing initializer: 517 TRT - VERBOSE Importing initializer: 313 TRT - VERBOSE Importing initializer: 316 TRT - VERBOSE Importing initializer: 320 TRT - VERBOSE Importing initializer: 322 TRT - VERBOSE Importing initializer: 319 TRT - VERBOSE Importing initializer: 333 TRT - VERBOSE Importing initializer: 335 TRT - VERBOSE Importing initializer: 340 TRT - VERBOSE Importing initializer: 341 TRT - VERBOSE Importing initializer: 339 TRT - VERBOSE Importing initializer: 348 TRT - VERBOSE Importing initializer: 352 TRT - VERBOSE Importing initializer: 355 TRT - VERBOSE Importing initializer: 357 TRT - VERBOSE Importing initializer: 365 TRT - VERBOSE Importing initializer: 367 TRT - VERBOSE Importing initializer: 375 TRT - VERBOSE Importing initializer: 377 TRT - VERBOSE Importing initializer: 523 TRT - VERBOSE Importing initializer: 524 TRT - VERBOSE Importing initializer: 403 TRT - VERBOSE Importing initializer: 404 TRT - VERBOSE Importing initializer: 432 TRT - VERBOSE Importing initializer: 433 TRT - VERBOSE Importing initializer: 435 TRT - VERBOSE Importing initializer: 437 TRT - VERBOSE Importing initializer: 439 TRT - VERBOSE Importing initializer: 445 TRT - VERBOSE Importing initializer: 448 TRT - VERBOSE Importing initializer: 450 TRT - VERBOSE Importing initializer: 462 TRT - VERBOSE Importing initializer: 465 TRT - VERBOSE Importing initializer: 467 TRT - VERBOSE Importing initializer: 469 TRT - VERBOSE Importing initializer: 474 TRT - VERBOSE Importing initializer: 477 TRT - VERBOSE Importing initializer: 479 TRT - VERBOSE Importing initializer: 481 TRT - VERBOSE Parsing node: Shape_0 [Shape] TRT - VERBOSE Searching for input: input TRT - VERBOSE Shape_0 [Shape] inputs: [input -> (-1, -1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_0 for ONNX node: Shape_0 TRT - VERBOSE Registering tensor: 34 for ONNX tensor: 34 TRT - VERBOSE Shape_0 [Shape] outputs: [34 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_2 [Gather] TRT - VERBOSE Searching for input: 34 TRT - VERBOSE Searching for input: 35 TRT - VERBOSE Gather_2 [Gather] inputs: [34 -> (4)[INT32]], [35 -> ()[INT32]], TRT - VERBOSE Registering layer: 35 for ONNX node: 35 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_2 for ONNX node: Gather_2 TRT - VERBOSE Registering tensor: 36 for ONNX tensor: 36 TRT - VERBOSE Gather_2 [Gather] outputs: [36 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_3 [Shape] TRT - VERBOSE Searching for input: input TRT - VERBOSE Shape_3 [Shape] inputs: [input -> (-1, -1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_3 for ONNX node: Shape_3 TRT - VERBOSE Registering tensor: 37 for ONNX tensor: 37 TRT - VERBOSE Shape_3 [Shape] outputs: [37 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_5 [Gather] TRT - VERBOSE Searching for input: 37 TRT - VERBOSE Searching for input: 38 TRT - VERBOSE Gather_5 [Gather] inputs: [37 -> (4)[INT32]], [38 -> ()[INT32]], TRT - VERBOSE Registering layer: 38 for ONNX node: 38 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_5 for ONNX node: Gather_5 TRT - VERBOSE Registering tensor: 39 for ONNX tensor: 39 TRT - VERBOSE Gather_5 [Gather] outputs: [39 -> ()[INT32]], TRT - VERBOSE Parsing node: Conv_6 [Conv] TRT - VERBOSE Searching for input: input TRT - VERBOSE Searching for input: 487 TRT - VERBOSE Searching for input: 488 TRT - VERBOSE Conv_6 [Conv] inputs: [input -> (-1, -1, -1, -1)[FLOAT]], [487 -> (32, 3, 3, 3)[FLOAT]], [488 -> (32)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, -1, -1, -1) TRT - VERBOSE Registering layer: Conv_6 for ONNX node: Conv_6 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 32 TRT - VERBOSE Convolution output dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering tensor: 486 for ONNX tensor: 486 TRT - VERBOSE Conv_6 [Conv] outputs: [486 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_7 [Relu] TRT - VERBOSE Searching for input: 486 TRT - VERBOSE Relu_7 [Relu] inputs: [486 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_7 for ONNX node: Relu_7 TRT - VERBOSE Registering tensor: 44 for ONNX tensor: 44 TRT - VERBOSE Relu_7 [Relu] outputs: [44 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_8 [Conv] TRT - VERBOSE Searching for input: 44 TRT - VERBOSE Searching for input: descriptor_map.layer_01.3.weight TRT - VERBOSE Conv_8 [Conv] inputs: [44 -> (-1, 32, -1, -1)[FLOAT]], [descriptor_map.layer_01.3.weight -> (32, 32, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering layer: Conv_8 for ONNX node: Conv_8 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 32 TRT - VERBOSE Convolution output dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering tensor: 45 for ONNX tensor: 45 TRT - VERBOSE Conv_8 [Conv] outputs: [45 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: BatchNormalization_11 [BatchNormalization] TRT - VERBOSE Searching for input: 45 TRT - VERBOSE Searching for input: 46 TRT - VERBOSE Searching for input: 47 TRT - VERBOSE Searching for input: descriptor_map.layer_23.0.running_mean TRT - VERBOSE Searching for input: descriptor_map.layer_23.0.running_var TRT - VERBOSE BatchNormalization_11 [BatchNormalization] inputs: [45 -> (-1, 32, -1, -1)[FLOAT]], [46 -> (32)[FLOAT]], [47 -> (32)[FLOAT]], [descriptor_map.layer_23.0.running_mean -> (32)[FLOAT]], [descriptor_map.layer_23.0.running_var -> (32)[FLOAT]], TRT - VERBOSE Registering layer: BatchNormalization_11 for ONNX node: BatchNormalization_11 TRT - VERBOSE Registering tensor: 48 for ONNX tensor: 48 TRT - VERBOSE BatchNormalization_11 [BatchNormalization] outputs: [48 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_12 [Relu] TRT - VERBOSE Searching for input: 48 TRT - VERBOSE Relu_12 [Relu] inputs: [48 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_12 for ONNX node: Relu_12 TRT - VERBOSE Registering tensor: 49 for ONNX tensor: 49 TRT - VERBOSE Relu_12 [Relu] outputs: [49 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_13 [Conv] TRT - VERBOSE Searching for input: 49 TRT - VERBOSE Searching for input: 490 TRT - VERBOSE Searching for input: 491 TRT - VERBOSE Conv_13 [Conv] inputs: [49 -> (-1, 32, -1, -1)[FLOAT]], [490 -> (64, 32, 3, 3)[FLOAT]], [491 -> (64)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering layer: Conv_13 for ONNX node: Conv_13 TRT - VERBOSE Using kernel: (3, 3), strides: (2, 2), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 64 TRT - VERBOSE Convolution output dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering tensor: 489 for ONNX tensor: 489 TRT - VERBOSE Conv_13 [Conv] outputs: [489 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_14 [Relu] TRT - VERBOSE Searching for input: 489 TRT - VERBOSE Relu_14 [Relu] inputs: [489 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_14 for ONNX node: Relu_14 TRT - VERBOSE Registering tensor: 54 for ONNX tensor: 54 TRT - VERBOSE Relu_14 [Relu] outputs: [54 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_15 [Conv] TRT - VERBOSE Searching for input: 54 TRT - VERBOSE Searching for input: descriptor_map.layer_23.5.weight TRT - VERBOSE Conv_15 [Conv] inputs: [54 -> (-1, 64, -1, -1)[FLOAT]], [descriptor_map.layer_23.5.weight -> (64, 64, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering layer: Conv_15 for ONNX node: Conv_15 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 64 TRT - VERBOSE Convolution output dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering tensor: 55 for ONNX tensor: 55 TRT - VERBOSE Conv_15 [Conv] outputs: [55 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: BatchNormalization_18 [BatchNormalization] TRT - VERBOSE Searching for input: 55 TRT - VERBOSE Searching for input: 56 TRT - VERBOSE Searching for input: 57 TRT - VERBOSE Searching for input: descriptor_map.layer_45.0.running_mean TRT - VERBOSE Searching for input: descriptor_map.layer_45.0.running_var TRT - VERBOSE BatchNormalization_18 [BatchNormalization] inputs: [55 -> (-1, 64, -1, -1)[FLOAT]], [56 -> (64)[FLOAT]], [57 -> (64)[FLOAT]], [descriptor_map.layer_45.0.running_mean -> (64)[FLOAT]], [descriptor_map.layer_45.0.running_var -> (64)[FLOAT]], TRT - VERBOSE Registering layer: BatchNormalization_18 for ONNX node: BatchNormalization_18 TRT - VERBOSE Registering tensor: 58 for ONNX tensor: 58 TRT - VERBOSE BatchNormalization_18 [BatchNormalization] outputs: [58 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_19 [Relu] TRT - VERBOSE Searching for input: 58 TRT - VERBOSE Relu_19 [Relu] inputs: [58 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_19 for ONNX node: Relu_19 TRT - VERBOSE Registering tensor: 59 for ONNX tensor: 59 TRT - VERBOSE Relu_19 [Relu] outputs: [59 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_20 [Conv] TRT - VERBOSE Searching for input: 59 TRT - VERBOSE Searching for input: 493 TRT - VERBOSE Searching for input: 494 TRT - VERBOSE Conv_20 [Conv] inputs: [59 -> (-1, 64, -1, -1)[FLOAT]], [493 -> (128, 64, 3, 3)[FLOAT]], [494 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering layer: Conv_20 for ONNX node: Conv_20 TRT - VERBOSE Using kernel: (3, 3), strides: (2, 2), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 492 for ONNX tensor: 492 TRT - VERBOSE Conv_20 [Conv] outputs: [492 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_21 [Relu] TRT - VERBOSE Searching for input: 492 TRT - VERBOSE Relu_21 [Relu] inputs: [492 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_21 for ONNX node: Relu_21 TRT - VERBOSE Registering tensor: 64 for ONNX tensor: 64 TRT - VERBOSE Relu_21 [Relu] outputs: [64 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_22 [Conv] TRT - VERBOSE Searching for input: 64 TRT - VERBOSE Searching for input: 496 TRT - VERBOSE Searching for input: 497 TRT - VERBOSE Conv_22 [Conv] inputs: [64 -> (-1, 128, -1, -1)[FLOAT]], [496 -> (128, 128, 3, 3)[FLOAT]], [497 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_22 for ONNX node: Conv_22 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 495 for ONNX tensor: 495 TRT - VERBOSE Conv_22 [Conv] outputs: [495 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_23 [Relu] TRT - VERBOSE Searching for input: 495 TRT - VERBOSE Relu_23 [Relu] inputs: [495 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_23 for ONNX node: Relu_23 TRT - VERBOSE Registering tensor: 69 for ONNX tensor: 69 TRT - VERBOSE Relu_23 [Relu] outputs: [69 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_24 [Conv] TRT - VERBOSE Searching for input: 69 TRT - VERBOSE Searching for input: 499 TRT - VERBOSE Searching for input: 500 TRT - VERBOSE Conv_24 [Conv] inputs: [69 -> (-1, 128, -1, -1)[FLOAT]], [499 -> (128, 128, 3, 3)[FLOAT]], [500 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_24 for ONNX node: Conv_24 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 498 for ONNX tensor: 498 TRT - VERBOSE Conv_24 [Conv] outputs: [498 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_25 [Relu] TRT - VERBOSE Searching for input: 498 TRT - VERBOSE Relu_25 [Relu] inputs: [498 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_25 for ONNX node: Relu_25 TRT - VERBOSE Registering tensor: 74 for ONNX tensor: 74 TRT - VERBOSE Relu_25 [Relu] outputs: [74 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_26 [Conv] TRT - VERBOSE Searching for input: 74 TRT - VERBOSE Searching for input: 502 TRT - VERBOSE Searching for input: 503 TRT - VERBOSE Conv_26 [Conv] inputs: [74 -> (-1, 128, -1, -1)[FLOAT]], [502 -> (128, 128, 3, 3)[FLOAT]], [503 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_26 for ONNX node: Conv_26 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 501 for ONNX tensor: 501 TRT - VERBOSE Conv_26 [Conv] outputs: [501 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_27 [Relu] TRT - VERBOSE Searching for input: 501 TRT - VERBOSE Relu_27 [Relu] inputs: [501 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_27 for ONNX node: Relu_27 TRT - VERBOSE Registering tensor: 79 for ONNX tensor: 79 TRT - VERBOSE Relu_27 [Relu] outputs: [79 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_28 [Conv] TRT - VERBOSE Searching for input: 79 TRT - VERBOSE Searching for input: descriptor_map.layer_678.6.weight TRT - VERBOSE Conv_28 [Conv] inputs: [79 -> (-1, 128, -1, -1)[FLOAT]], [descriptor_map.layer_678.6.weight -> (128, 128, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_28 for ONNX node: Conv_28 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: dense_feat_map_82 for ONNX tensor: dense_feat_map TRT - VERBOSE Conv_28 [Conv] outputs: [dense_feat_map -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_29 [Shape] TRT - VERBOSE Searching for input: dense_feat_map TRT - VERBOSE Shape_29 [Shape] inputs: [dense_feat_map -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_29 for ONNX node: Shape_29 TRT - VERBOSE Registering tensor: 81 for ONNX tensor: 81 TRT - VERBOSE Shape_29 [Shape] outputs: [81 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_31 [Gather] TRT - VERBOSE Searching for input: 81 TRT - VERBOSE Searching for input: 82 TRT - VERBOSE Gather_31 [Gather] inputs: [81 -> (4)[INT32]], [82 -> ()[INT32]], TRT - VERBOSE Registering layer: 82 for ONNX node: 82 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_31 for ONNX node: Gather_31 TRT - VERBOSE Registering tensor: 83 for ONNX tensor: 83 TRT - VERBOSE Gather_31 [Gather] outputs: [83 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_33 [Unsqueeze] TRT - VERBOSE Searching for input: 83 TRT - VERBOSE Searching for input: 85 TRT - VERBOSE Unsqueeze_33 [Unsqueeze] inputs: [83 -> ()[INT32]], [85 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_33 for ONNX node: Unsqueeze_33 TRT - VERBOSE Registering tensor: 86 for ONNX tensor: 86 TRT - VERBOSE Unsqueeze_33 [Unsqueeze] outputs: [86 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_35 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 89 TRT - VERBOSE Unsqueeze_35 [Unsqueeze] inputs: [36 -> ()[INT32]], [89 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_35 for ONNX node: Unsqueeze_35 TRT - VERBOSE Registering tensor: 90 for ONNX tensor: 90 TRT - VERBOSE Unsqueeze_35 [Unsqueeze] outputs: [90 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_37 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 91 TRT - VERBOSE Unsqueeze_37 [Unsqueeze] inputs: [39 -> ()[INT32]], [91 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_37 for ONNX node: Unsqueeze_37 TRT - VERBOSE Registering tensor: 92 for ONNX tensor: 92 TRT - VERBOSE Unsqueeze_37 [Unsqueeze] outputs: [92 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_38 [Concat] TRT - VERBOSE Searching for input: 86 TRT - VERBOSE Searching for input: 504 TRT - VERBOSE Searching for input: 90 TRT - VERBOSE Searching for input: 92 TRT - VERBOSE Concat_38 [Concat] inputs: [86 -> (1)[INT32]], [504 -> (1)[INT32]], [90 -> (1)[INT32]], [92 -> (1)[INT32]], TRT - VERBOSE Registering layer: 504 for ONNX node: 504 TRT - VERBOSE Registering layer: Concat_38 for ONNX node: Concat_38 TRT - VERBOSE Registering tensor: 93 for ONNX tensor: 93 TRT - VERBOSE Concat_38 [Concat] outputs: [93 -> (4)[INT32]], TRT - VERBOSE Parsing node: ConstantOfShape_39 [ConstantOfShape] TRT - VERBOSE Searching for input: 93 TRT - VERBOSE ConstantOfShape_39 [ConstantOfShape] inputs: [93 -> (4)[INT32]], TRT - VERBOSE Registering layer: ConstantOfShape_39 for ONNX node: ConstantOfShape_39 TRT - VERBOSE Registering tensor: 94 for ONNX tensor: 94 TRT - VERBOSE ConstantOfShape_39 [ConstantOfShape] outputs: [94 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_41 [Div] TRT - VERBOSE Searching for input: 45 TRT - VERBOSE Searching for input: 95 TRT - VERBOSE Div_41 [Div] inputs: [45 -> (-1, 32, -1, -1)[FLOAT]], [95 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 95 for ONNX node: 95 TRT - VERBOSE Registering layer: Div_41 for ONNX node: Div_41 TRT - VERBOSE Registering tensor: 96 for ONNX tensor: 96 TRT - VERBOSE Div_41 [Div] outputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_42 [Shape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Shape_42 [Shape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_42 for ONNX node: Shape_42 TRT - VERBOSE Registering tensor: 97 for ONNX tensor: 97 TRT - VERBOSE Shape_42 [Shape] outputs: [97 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_44 [Gather] TRT - VERBOSE Searching for input: 97 TRT - VERBOSE Searching for input: 98 TRT - VERBOSE Gather_44 [Gather] inputs: [97 -> (4)[INT32]], [98 -> ()[INT32]], TRT - VERBOSE Registering layer: 98 for ONNX node: 98 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_44 for ONNX node: Gather_44 TRT - VERBOSE Registering tensor: 99 for ONNX tensor: 99 TRT - VERBOSE Gather_44 [Gather] outputs: [99 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_48 [Shape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Shape_48 [Shape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_48 for ONNX node: Shape_48 TRT - VERBOSE Registering tensor: 103 for ONNX tensor: 103 TRT - VERBOSE Shape_48 [Shape] outputs: [103 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_50 [Gather] TRT - VERBOSE Searching for input: 103 TRT - VERBOSE Searching for input: 104 TRT - VERBOSE Gather_50 [Gather] inputs: [103 -> (4)[INT32]], [104 -> ()[INT32]], TRT - VERBOSE Registering layer: 104 for ONNX node: 104 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_50 for ONNX node: Gather_50 TRT - VERBOSE Registering tensor: 105 for ONNX tensor: 105 TRT - VERBOSE Gather_50 [Gather] outputs: [105 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_51 [Shape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Shape_51 [Shape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_51 for ONNX node: Shape_51 TRT - VERBOSE Registering tensor: 106 for ONNX tensor: 106 TRT - VERBOSE Shape_51 [Shape] outputs: [106 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_53 [Gather] TRT - VERBOSE Searching for input: 106 TRT - VERBOSE Searching for input: 107 TRT - VERBOSE Gather_53 [Gather] inputs: [106 -> (4)[INT32]], [107 -> ()[INT32]], TRT - VERBOSE Registering layer: 107 for ONNX node: 107 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_53 for ONNX node: Gather_53 TRT - VERBOSE Registering tensor: 108 for ONNX tensor: 108 TRT - VERBOSE Gather_53 [Gather] outputs: [108 -> ()[INT32]], TRT - VERBOSE Parsing node: Mul_54 [Mul] TRT - VERBOSE Searching for input: 99 TRT - VERBOSE Searching for input: 102 TRT - VERBOSE Mul_54 [Mul] inputs: [99 -> ()[INT32]], [102 -> ()[INT32]], TRT - VERBOSE Registering layer: 102 for ONNX node: 102 TRT - VERBOSE Registering layer: Mul_54 for ONNX node: Mul_54 TRT - VERBOSE Registering tensor: 109 for ONNX tensor: 109 TRT - VERBOSE Mul_54 [Mul] outputs: [109 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_56 [Unsqueeze] TRT - VERBOSE Searching for input: 109 TRT - VERBOSE Searching for input: 111 TRT - VERBOSE Unsqueeze_56 [Unsqueeze] inputs: [109 -> ()[INT32]], [111 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_56 for ONNX node: Unsqueeze_56 TRT - VERBOSE Registering tensor: 112 for ONNX tensor: 112 TRT - VERBOSE Unsqueeze_56 [Unsqueeze] outputs: [112 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_58 [Unsqueeze] TRT - VERBOSE Searching for input: 105 TRT - VERBOSE Searching for input: 115 TRT - VERBOSE Unsqueeze_58 [Unsqueeze] inputs: [105 -> ()[INT32]], [115 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_58 for ONNX node: Unsqueeze_58 TRT - VERBOSE Registering tensor: 116 for ONNX tensor: 116 TRT - VERBOSE Unsqueeze_58 [Unsqueeze] outputs: [116 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_60 [Unsqueeze] TRT - VERBOSE Searching for input: 108 TRT - VERBOSE Searching for input: 117 TRT - VERBOSE Unsqueeze_60 [Unsqueeze] inputs: [108 -> ()[INT32]], [117 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_60 for ONNX node: Unsqueeze_60 TRT - VERBOSE Registering tensor: 118 for ONNX tensor: 118 TRT - VERBOSE Unsqueeze_60 [Unsqueeze] outputs: [118 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_61 [Concat] TRT - VERBOSE Searching for input: 112 TRT - VERBOSE Searching for input: 505 TRT - VERBOSE Searching for input: 116 TRT - VERBOSE Searching for input: 118 TRT - VERBOSE Concat_61 [Concat] inputs: [112 -> (1)[INT32]], [505 -> (1)[INT32]], [116 -> (1)[INT32]], [118 -> (1)[INT32]], TRT - VERBOSE Registering layer: 505 for ONNX node: 505 TRT - VERBOSE Registering layer: Concat_61 for ONNX node: Concat_61 TRT - VERBOSE Registering tensor: 119 for ONNX tensor: 119 TRT - VERBOSE Concat_61 [Concat] outputs: [119 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_62 [Reshape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Searching for input: 119 TRT - VERBOSE Reshape_62 [Reshape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], [119 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_62 for ONNX node: Reshape_62 TRT - VERBOSE Registering tensor: 120 for ONNX tensor: 120 TRT - VERBOSE Reshape_62 [Reshape] outputs: [120 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_76 [Pad] TRT - VERBOSE Searching for input: 120 TRT - VERBOSE Searching for input: 142 TRT - VERBOSE Pad_76 [Pad] inputs: [120 -> (-1, 1, -1, -1)[FLOAT]], [142 -> (8)[INT32]], TRT - VERBOSE Registering layer: Pad_76 for ONNX node: Pad_76 TRT - VERBOSE Registering tensor: 143 for ONNX tensor: 143 TRT - VERBOSE Pad_76 [Pad] outputs: [143 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_78 [Conv] TRT - VERBOSE Searching for input: 143 TRT - VERBOSE Searching for input: 144 TRT - VERBOSE Conv_78 [Conv] inputs: [143 -> (-1, 1, -1, -1)[FLOAT]], [144 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_78 for ONNX node: Conv_78 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 145 for ONNX tensor: 145 TRT - VERBOSE Conv_78 [Conv] outputs: [145 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_80 [Unsqueeze] TRT - VERBOSE Searching for input: 99 TRT - VERBOSE Searching for input: 146 TRT - VERBOSE Unsqueeze_80 [Unsqueeze] inputs: [99 -> ()[INT32]], [146 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_80 for ONNX node: Unsqueeze_80 TRT - VERBOSE Registering tensor: 147 for ONNX tensor: 147 TRT - VERBOSE Unsqueeze_80 [Unsqueeze] outputs: [147 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_84 [Unsqueeze] TRT - VERBOSE Searching for input: 105 TRT - VERBOSE Searching for input: 150 TRT - VERBOSE Unsqueeze_84 [Unsqueeze] inputs: [105 -> ()[INT32]], [150 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_84 for ONNX node: Unsqueeze_84 TRT - VERBOSE Registering tensor: 151 for ONNX tensor: 151 TRT - VERBOSE Unsqueeze_84 [Unsqueeze] outputs: [151 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_86 [Unsqueeze] TRT - VERBOSE Searching for input: 108 TRT - VERBOSE Searching for input: 152 TRT - VERBOSE Unsqueeze_86 [Unsqueeze] inputs: [108 -> ()[INT32]], [152 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_86 for ONNX node: Unsqueeze_86 TRT - VERBOSE Registering tensor: 153 for ONNX tensor: 153 TRT - VERBOSE Unsqueeze_86 [Unsqueeze] outputs: [153 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_87 [Concat] TRT - VERBOSE Searching for input: 147 TRT - VERBOSE Searching for input: 149 TRT - VERBOSE Searching for input: 151 TRT - VERBOSE Searching for input: 153 TRT - VERBOSE Concat_87 [Concat] inputs: [147 -> (1)[INT32]], [149 -> (1)[INT32]], [151 -> (1)[INT32]], [153 -> (1)[INT32]], TRT - VERBOSE Registering layer: 149 for ONNX node: 149 TRT - VERBOSE Registering layer: Concat_87 for ONNX node: Concat_87 TRT - VERBOSE Registering tensor: 154 for ONNX tensor: 154 TRT - VERBOSE Concat_87 [Concat] outputs: [154 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_88 [Reshape] TRT - VERBOSE Searching for input: 145 TRT - VERBOSE Searching for input: 154 TRT - VERBOSE Reshape_88 [Reshape] inputs: [145 -> (-1, 1, -1, -1)[FLOAT]], [154 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_88 for ONNX node: Reshape_88 TRT - VERBOSE Registering tensor: 155 for ONNX tensor: 155 TRT - VERBOSE Reshape_88 [Reshape] outputs: [155 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMean_89 [ReduceMean] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE ReduceMean_89 [ReduceMean] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMean_89 for ONNX node: ReduceMean_89 TRT - VERBOSE Registering tensor: 156 for ONNX tensor: 156 TRT - VERBOSE ReduceMean_89 [ReduceMean] outputs: [156 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_90 [Sub] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Searching for input: 155 TRT - VERBOSE Sub_90 [Sub] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], [155 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_90 for ONNX node: Sub_90 TRT - VERBOSE Registering tensor: 157 for ONNX tensor: 157 TRT - VERBOSE Sub_90 [Sub] outputs: [157 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_91 [Softplus] TRT - VERBOSE Searching for input: 157 TRT - VERBOSE Softplus_91 [Softplus] inputs: [157 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_91 for ONNX node: Softplus_91 TRT - VERBOSE Registering tensor: 158 for ONNX tensor: 158 TRT - VERBOSE Softplus_91 [Softplus] outputs: [158 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_92 [Sub] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Searching for input: 156 TRT - VERBOSE Sub_92 [Sub] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], [156 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_92 for ONNX node: Sub_92 TRT - VERBOSE Registering tensor: 159 for ONNX tensor: 159 TRT - VERBOSE Sub_92 [Sub] outputs: [159 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_93 [Softplus] TRT - VERBOSE Searching for input: 159 TRT - VERBOSE Softplus_93 [Softplus] inputs: [159 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_93 for ONNX node: Softplus_93 TRT - VERBOSE Registering tensor: 160 for ONNX tensor: 160 TRT - VERBOSE Softplus_93 [Softplus] outputs: [160 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_94 [Mul] TRT - VERBOSE Searching for input: 158 TRT - VERBOSE Searching for input: 160 TRT - VERBOSE Mul_94 [Mul] inputs: [158 -> (-1, 32, -1, -1)[FLOAT]], [160 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_94 for ONNX node: Mul_94 TRT - VERBOSE Registering tensor: 161 for ONNX tensor: 161 TRT - VERBOSE Mul_94 [Mul] outputs: [161 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMax_95 [ReduceMax] TRT - VERBOSE Searching for input: 161 TRT - VERBOSE ReduceMax_95 [ReduceMax] inputs: [161 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMax_95 for ONNX node: ReduceMax_95 TRT - VERBOSE Registering tensor: 162 for ONNX tensor: 162 TRT - VERBOSE ReduceMax_95 [ReduceMax] outputs: [162 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_97 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 163 TRT - VERBOSE Unsqueeze_97 [Unsqueeze] inputs: [36 -> ()[INT32]], [163 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_97 for ONNX node: Unsqueeze_97 TRT - VERBOSE Registering tensor: 164 for ONNX tensor: 164 TRT - VERBOSE Unsqueeze_97 [Unsqueeze] outputs: [164 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_99 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 165 TRT - VERBOSE Unsqueeze_99 [Unsqueeze] inputs: [39 -> ()[INT32]], [165 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_99 for ONNX node: Unsqueeze_99 TRT - VERBOSE Registering tensor: 166 for ONNX tensor: 166 TRT - VERBOSE Unsqueeze_99 [Unsqueeze] outputs: [166 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_100 [Concat] TRT - VERBOSE Searching for input: 164 TRT - VERBOSE Searching for input: 166 TRT - VERBOSE Concat_100 [Concat] inputs: [164 -> (1)[INT32]], [166 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_100 for ONNX node: Concat_100 TRT - VERBOSE Registering tensor: 167 for ONNX tensor: 167 TRT - VERBOSE Concat_100 [Concat] outputs: [167 -> (2)[INT32]], TRT - VERBOSE Parsing node: Shape_101 [Shape] TRT - VERBOSE Searching for input: 162 TRT - VERBOSE Shape_101 [Shape] inputs: [162 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_101 for ONNX node: Shape_101 TRT - VERBOSE Registering tensor: 168 for ONNX tensor: 168 TRT - VERBOSE Shape_101 [Shape] outputs: [168 -> (4)[INT32]], TRT - VERBOSE Parsing node: Slice_105 [Slice] TRT - VERBOSE Searching for input: 168 TRT - VERBOSE Searching for input: 170 TRT - VERBOSE Searching for input: 171 TRT - VERBOSE Searching for input: 169 TRT - VERBOSE Slice_105 [Slice] inputs: [168 -> (4)[INT32]], [170 -> (1)[INT32]], [171 -> (1)[INT32]], [169 -> (1)[INT32]], TRT - VERBOSE Registering layer: Slice_105 for ONNX node: Slice_105 TRT - VERBOSE Registering tensor: 172 for ONNX tensor: 172 TRT - VERBOSE Slice_105 [Slice] outputs: [172 -> (2)[INT32]], TRT - VERBOSE Parsing node: Cast_106 [Cast] TRT - VERBOSE Searching for input: 167 TRT - VERBOSE Cast_106 [Cast] inputs: [167 -> (2)[INT32]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_106 for ONNX node: Cast_106 TRT - VERBOSE Registering tensor: 173 for ONNX tensor: 173 TRT - VERBOSE Cast_106 [Cast] outputs: [173 -> (2)[INT32]], TRT - VERBOSE Parsing node: Concat_107 [Concat] TRT - VERBOSE Searching for input: 172 TRT - VERBOSE Searching for input: 173 TRT - VERBOSE Concat_107 [Concat] inputs: [172 -> (2)[INT32]], [173 -> (2)[INT32]], TRT - VERBOSE Registering layer: Concat_107 for ONNX node: Concat_107 TRT - VERBOSE Registering tensor: 174 for ONNX tensor: 174 TRT - VERBOSE Concat_107 [Concat] outputs: [174 -> (4)[INT32]], TRT - VERBOSE Parsing node: Resize_108 [Resize] TRT - VERBOSE Searching for input: 162 TRT - VERBOSE Searching for input: 174 TRT - VERBOSE Resize_108 [Resize] inputs: [162 -> (-1, 1, -1, -1)[FLOAT]], [optional input, not set], [optional input, not set], [174 -> (4)[INT32]], TRT - VERBOSE Registering layer: Resize_108 for ONNX node: Resize_108 TRT - VERBOSE Registering tensor: 177 for ONNX tensor: 177 TRT - VERBOSE Resize_108 [Resize] outputs: [177 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_110 [Mul] TRT - VERBOSE Searching for input: 178 TRT - VERBOSE Searching for input: 177 TRT - VERBOSE Mul_110 [Mul] inputs: [178 -> ()[FLOAT]], [177 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: 178 for ONNX node: 178 TRT - VERBOSE Registering layer: Mul_110 for ONNX node: Mul_110 TRT - VERBOSE Registering tensor: 179 for ONNX tensor: 179 TRT - VERBOSE Mul_110 [Mul] outputs: [179 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_111 [Add] TRT - VERBOSE Searching for input: 94 TRT - VERBOSE Searching for input: 179 TRT - VERBOSE Add_111 [Add] inputs: [94 -> (-1, 1, -1, -1)[FLOAT]], [179 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_111 for ONNX node: Add_111 TRT - VERBOSE Registering tensor: 180 for ONNX tensor: 180 TRT - VERBOSE Add_111 [Add] outputs: [180 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_113 [Div] TRT - VERBOSE Searching for input: 55 TRT - VERBOSE Searching for input: 181 TRT - VERBOSE Div_113 [Div] inputs: [55 -> (-1, 64, -1, -1)[FLOAT]], [181 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 181 for ONNX node: 181 TRT - VERBOSE Registering layer: Div_113 for ONNX node: Div_113 TRT - VERBOSE Registering tensor: 182 for ONNX tensor: 182 TRT - VERBOSE Div_113 [Div] outputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_114 [Shape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Shape_114 [Shape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_114 for ONNX node: Shape_114 TRT - VERBOSE Registering tensor: 183 for ONNX tensor: 183 TRT - VERBOSE Shape_114 [Shape] outputs: [183 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_116 [Gather] TRT - VERBOSE Searching for input: 183 TRT - VERBOSE Searching for input: 184 TRT - VERBOSE Gather_116 [Gather] inputs: [183 -> (4)[INT32]], [184 -> ()[INT32]], TRT - VERBOSE Registering layer: 184 for ONNX node: 184 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_116 for ONNX node: Gather_116 TRT - VERBOSE Registering tensor: 185 for ONNX tensor: 185 TRT - VERBOSE Gather_116 [Gather] outputs: [185 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_120 [Shape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Shape_120 [Shape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_120 for ONNX node: Shape_120 TRT - VERBOSE Registering tensor: 189 for ONNX tensor: 189 TRT - VERBOSE Shape_120 [Shape] outputs: [189 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_122 [Gather] TRT - VERBOSE Searching for input: 189 TRT - VERBOSE Searching for input: 190 TRT - VERBOSE Gather_122 [Gather] inputs: [189 -> (4)[INT32]], [190 -> ()[INT32]], TRT - VERBOSE Registering layer: 190 for ONNX node: 190 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_122 for ONNX node: Gather_122 TRT - VERBOSE Registering tensor: 191 for ONNX tensor: 191 TRT - VERBOSE Gather_122 [Gather] outputs: [191 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_123 [Shape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Shape_123 [Shape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_123 for ONNX node: Shape_123 TRT - VERBOSE Registering tensor: 192 for ONNX tensor: 192 TRT - VERBOSE Shape_123 [Shape] outputs: [192 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_125 [Gather] TRT - VERBOSE Searching for input: 192 TRT - VERBOSE Searching for input: 193 TRT - VERBOSE Gather_125 [Gather] inputs: [192 -> (4)[INT32]], [193 -> ()[INT32]], TRT - VERBOSE Registering layer: 193 for ONNX node: 193 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_125 for ONNX node: Gather_125 TRT - VERBOSE Registering tensor: 194 for ONNX tensor: 194 TRT - VERBOSE Gather_125 [Gather] outputs: [194 -> ()[INT32]], TRT - VERBOSE Parsing node: Mul_126 [Mul] TRT - VERBOSE Searching for input: 185 TRT - VERBOSE Searching for input: 188 TRT - VERBOSE Mul_126 [Mul] inputs: [185 -> ()[INT32]], [188 -> ()[INT32]], TRT - VERBOSE Registering layer: 188 for ONNX node: 188 TRT - VERBOSE Registering layer: Mul_126 for ONNX node: Mul_126 TRT - VERBOSE Registering tensor: 195 for ONNX tensor: 195 TRT - VERBOSE Mul_126 [Mul] outputs: [195 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_128 [Unsqueeze] TRT - VERBOSE Searching for input: 195 TRT - VERBOSE Searching for input: 197 TRT - VERBOSE Unsqueeze_128 [Unsqueeze] inputs: [195 -> ()[INT32]], [197 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_128 for ONNX node: Unsqueeze_128 TRT - VERBOSE Registering tensor: 198 for ONNX tensor: 198 TRT - VERBOSE Unsqueeze_128 [Unsqueeze] outputs: [198 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_130 [Unsqueeze] TRT - VERBOSE Searching for input: 191 TRT - VERBOSE Searching for input: 201 TRT - VERBOSE Unsqueeze_130 [Unsqueeze] inputs: [191 -> ()[INT32]], [201 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_130 for ONNX node: Unsqueeze_130 TRT - VERBOSE Registering tensor: 202 for ONNX tensor: 202 TRT - VERBOSE Unsqueeze_130 [Unsqueeze] outputs: [202 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_132 [Unsqueeze] TRT - VERBOSE Searching for input: 194 TRT - VERBOSE Searching for input: 203 TRT - VERBOSE Unsqueeze_132 [Unsqueeze] inputs: [194 -> ()[INT32]], [203 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_132 for ONNX node: Unsqueeze_132 TRT - VERBOSE Registering tensor: 204 for ONNX tensor: 204 TRT - VERBOSE Unsqueeze_132 [Unsqueeze] outputs: [204 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_133 [Concat] TRT - VERBOSE Searching for input: 198 TRT - VERBOSE Searching for input: 511 TRT - VERBOSE Searching for input: 202 TRT - VERBOSE Searching for input: 204 TRT - VERBOSE Concat_133 [Concat] inputs: [198 -> (1)[INT32]], [511 -> (1)[INT32]], [202 -> (1)[INT32]], [204 -> (1)[INT32]], TRT - VERBOSE Registering layer: 511 for ONNX node: 511 TRT - VERBOSE Registering layer: Concat_133 for ONNX node: Concat_133 TRT - VERBOSE Registering tensor: 205 for ONNX tensor: 205 TRT - VERBOSE Concat_133 [Concat] outputs: [205 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_134 [Reshape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Searching for input: 205 TRT - VERBOSE Reshape_134 [Reshape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], [205 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_134 for ONNX node: Reshape_134 TRT - VERBOSE Registering tensor: 206 for ONNX tensor: 206 TRT - VERBOSE Reshape_134 [Reshape] outputs: [206 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_148 [Pad] TRT - VERBOSE Searching for input: 206 TRT - VERBOSE Searching for input: 228 TRT - VERBOSE Pad_148 [Pad] inputs: [206 -> (-1, 1, -1, -1)[FLOAT]], [228 -> (8)[INT32]], TRT - VERBOSE Registering layer: Pad_148 for ONNX node: Pad_148 TRT - VERBOSE Registering tensor: 229 for ONNX tensor: 229 TRT - VERBOSE Pad_148 [Pad] outputs: [229 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_149 [Conv] TRT - VERBOSE Searching for input: 229 TRT - VERBOSE Searching for input: 144 TRT - VERBOSE Conv_149 [Conv] inputs: [229 -> (-1, 1, -1, -1)[FLOAT]], [144 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_149 for ONNX node: Conv_149 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (2, 2), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 230 for ONNX tensor: 230 TRT - VERBOSE Conv_149 [Conv] outputs: [230 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_151 [Unsqueeze] TRT - VERBOSE Searching for input: 185 TRT - VERBOSE Searching for input: 231 TRT - VERBOSE Unsqueeze_151 [Unsqueeze] inputs: [185 -> ()[INT32]], [231 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_151 for ONNX node: Unsqueeze_151 TRT - VERBOSE Registering tensor: 232 for ONNX tensor: 232 TRT - VERBOSE Unsqueeze_151 [Unsqueeze] outputs: [232 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_155 [Unsqueeze] TRT - VERBOSE Searching for input: 191 TRT - VERBOSE Searching for input: 235 TRT - VERBOSE Unsqueeze_155 [Unsqueeze] inputs: [191 -> ()[INT32]], [235 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_155 for ONNX node: Unsqueeze_155 TRT - VERBOSE Registering tensor: 236 for ONNX tensor: 236 TRT - VERBOSE Unsqueeze_155 [Unsqueeze] outputs: [236 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_157 [Unsqueeze] TRT - VERBOSE Searching for input: 194 TRT - VERBOSE Searching for input: 237 TRT - VERBOSE Unsqueeze_157 [Unsqueeze] inputs: [194 -> ()[INT32]], [237 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_157 for ONNX node: Unsqueeze_157 TRT - VERBOSE Registering tensor: 238 for ONNX tensor: 238 TRT - VERBOSE Unsqueeze_157 [Unsqueeze] outputs: [238 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_158 [Concat] TRT - VERBOSE Searching for input: 232 TRT - VERBOSE Searching for input: 234 TRT - VERBOSE Searching for input: 236 TRT - VERBOSE Searching for input: 238 TRT - VERBOSE Concat_158 [Concat] inputs: [232 -> (1)[INT32]], [234 -> (1)[INT32]], [236 -> (1)[INT32]], [238 -> (1)[INT32]], TRT - VERBOSE Registering layer: 234 for ONNX node: 234 TRT - VERBOSE Registering layer: Concat_158 for ONNX node: Concat_158 TRT - VERBOSE Registering tensor: 239 for ONNX tensor: 239 TRT - VERBOSE Concat_158 [Concat] outputs: [239 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_159 [Reshape] TRT - VERBOSE Searching for input: 230 TRT - VERBOSE Searching for input: 239 TRT - VERBOSE Reshape_159 [Reshape] inputs: [230 -> (-1, 1, -1, -1)[FLOAT]], [239 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_159 for ONNX node: Reshape_159 TRT - VERBOSE Registering tensor: 240 for ONNX tensor: 240 TRT - VERBOSE Reshape_159 [Reshape] outputs: [240 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMean_160 [ReduceMean] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE ReduceMean_160 [ReduceMean] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMean_160 for ONNX node: ReduceMean_160 TRT - VERBOSE Registering tensor: 241 for ONNX tensor: 241 TRT - VERBOSE ReduceMean_160 [ReduceMean] outputs: [241 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_161 [Sub] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Searching for input: 240 TRT - VERBOSE Sub_161 [Sub] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], [240 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_161 for ONNX node: Sub_161 TRT - VERBOSE Registering tensor: 242 for ONNX tensor: 242 TRT - VERBOSE Sub_161 [Sub] outputs: [242 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_162 [Softplus] TRT - VERBOSE Searching for input: 242 TRT - VERBOSE Softplus_162 [Softplus] inputs: [242 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_162 for ONNX node: Softplus_162 TRT - VERBOSE Registering tensor: 243 for ONNX tensor: 243 TRT - VERBOSE Softplus_162 [Softplus] outputs: [243 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_163 [Sub] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Searching for input: 241 TRT - VERBOSE Sub_163 [Sub] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], [241 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_163 for ONNX node: Sub_163 TRT - VERBOSE Registering tensor: 244 for ONNX tensor: 244 TRT - VERBOSE Sub_163 [Sub] outputs: [244 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_164 [Softplus] TRT - VERBOSE Searching for input: 244 TRT - VERBOSE Softplus_164 [Softplus] inputs: [244 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_164 for ONNX node: Softplus_164 TRT - VERBOSE Registering tensor: 245 for ONNX tensor: 245 TRT - VERBOSE Softplus_164 [Softplus] outputs: [245 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_165 [Mul] TRT - VERBOSE Searching for input: 243 TRT - VERBOSE Searching for input: 245 TRT - VERBOSE Mul_165 [Mul] inputs: [243 -> (-1, 64, -1, -1)[FLOAT]], [245 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_165 for ONNX node: Mul_165 TRT - VERBOSE Registering tensor: 246 for ONNX tensor: 246 TRT - VERBOSE Mul_165 [Mul] outputs: [246 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMax_166 [ReduceMax] TRT - VERBOSE Searching for input: 246 TRT - VERBOSE ReduceMax_166 [ReduceMax] inputs: [246 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMax_166 for ONNX node: ReduceMax_166 TRT - VERBOSE Registering tensor: 247 for ONNX tensor: 247 TRT - VERBOSE ReduceMax_166 [ReduceMax] outputs: [247 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_168 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 248 TRT - VERBOSE Unsqueeze_168 [Unsqueeze] inputs: [36 -> ()[INT32]], [248 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_168 for ONNX node: Unsqueeze_168 TRT - VERBOSE Registering tensor: 249 for ONNX tensor: 249 TRT - VERBOSE Unsqueeze_168 [Unsqueeze] outputs: [249 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_170 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 250 TRT - VERBOSE Unsqueeze_170 [Unsqueeze] inputs: [39 -> ()[INT32]], [250 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_170 for ONNX node: Unsqueeze_170 TRT - VERBOSE Registering tensor: 251 for ONNX tensor: 251 TRT - VERBOSE Unsqueeze_170 [Unsqueeze] outputs: [251 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_171 [Concat] TRT - VERBOSE Searching for input: 249 TRT - VERBOSE Searching for input: 251 TRT - VERBOSE Concat_171 [Concat] inputs: [249 -> (1)[INT32]], [251 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_171 for ONNX node: Concat_171 TRT - VERBOSE Registering tensor: 252 for ONNX tensor: 252 TRT - VERBOSE Concat_171 [Concat] outputs: [252 -> (2)[INT32]], TRT - VERBOSE Parsing node: Shape_172 [Shape] TRT - VERBOSE Searching for input: 247 TRT - VERBOSE Shape_172 [Shape] inputs: [247 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_172 for ONNX node: Shape_172 TRT - VERBOSE Registering tensor: 253 for ONNX tensor: 253 TRT - VERBOSE Shape_172 [Shape] outputs: [253 -> (4)[INT32]], TRT - VERBOSE Parsing node: Slice_176 [Slice] TRT - VERBOSE Searching for input: 253 TRT - VERBOSE Searching for input: 255 TRT - VERBOSE Searching for input: 256 TRT - VERBOSE Searching for input: 254 TRT - VERBOSE Slice_176 [Slice] inputs: [253 -> (4)[INT32]], [255 -> (1)[INT32]], [256 -> (1)[INT32]], [254 -> (1)[INT32]], TRT - VERBOSE Registering layer: Slice_176 for ONNX node: Slice_176 TRT - VERBOSE Registering tensor: 257 for ONNX tensor: 257 TRT - VERBOSE Slice_176 [Slice] outputs: [257 -> (2)[INT32]], TRT - VERBOSE Parsing node: Cast_177 [Cast] TRT - VERBOSE Searching for input: 252 TRT - VERBOSE Cast_177 [Cast] inputs: [252 -> (2)[INT32]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_177 for ONNX node: Cast_177 TRT - VERBOSE Registering tensor: 258 for ONNX tensor: 258 TRT - VERBOSE Cast_177 [Cast] outputs: [258 -> (2)[INT32]], TRT - VERBOSE Parsing node: Concat_178 [Concat] TRT - VERBOSE Searching for input: 257 TRT - VERBOSE Searching for input: 258 TRT - VERBOSE Concat_178 [Concat] inputs: [257 -> (2)[INT32]], [258 -> (2)[INT32]], TRT - VERBOSE Registering layer: Concat_178 for ONNX node: Concat_178 TRT - VERBOSE Registering tensor: 259 for ONNX tensor: 259 TRT - VERBOSE Concat_178 [Concat] outputs: [259 -> (4)[INT32]], TRT - VERBOSE Parsing node: Resize_179 [Resize] TRT - VERBOSE Searching for input: 247 TRT - VERBOSE Searching for input: 259 TRT - VERBOSE Resize_179 [Resize] inputs: [247 -> (-1, 1, -1, -1)[FLOAT]], [optional input, not set], [optional input, not set], [259 -> (4)[INT32]], TRT - VERBOSE Registering layer: Resize_179 for ONNX node: Resize_179 TRT - VERBOSE Registering tensor: 262 for ONNX tensor: 262 TRT - VERBOSE Resize_179 [Resize] outputs: [262 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_181 [Mul] TRT - VERBOSE Searching for input: 263 TRT - VERBOSE Searching for input: 262 TRT - VERBOSE Mul_181 [Mul] inputs: [263 -> ()[FLOAT]], [262 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: 263 for ONNX node: 263 TRT - VERBOSE Registering layer: Mul_181 for ONNX node: Mul_181 TRT - VERBOSE Registering tensor: 264 for ONNX tensor: 264 TRT - VERBOSE Mul_181 [Mul] outputs: [264 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_182 [Add] TRT - VERBOSE Searching for input: 180 TRT - VERBOSE Searching for input: 264 TRT - VERBOSE Add_182 [Add] inputs: [180 -> (-1, 1, -1, -1)[FLOAT]], [264 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_182 for ONNX node: Add_182 TRT - VERBOSE Registering tensor: 265 for ONNX tensor: 265 TRT - VERBOSE Add_182 [Add] outputs: [265 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_184 [Div] TRT - VERBOSE Searching for input: dense_feat_map TRT - VERBOSE Searching for input: 266 TRT - VERBOSE Div_184 [Div] inputs: [dense_feat_map -> (-1, 128, -1, -1)[FLOAT]], [266 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 266 for ONNX node: 266 TRT - VERBOSE Registering layer: Div_184 for ONNX node: Div_184 TRT - VERBOSE Registering tensor: 267 for ONNX tensor: 267 TRT - VERBOSE Div_184 [Div] outputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_185 [Shape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Shape_185 [Shape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_185 for ONNX node: Shape_185 TRT - VERBOSE Registering tensor: 268 for ONNX tensor: 268 TRT - VERBOSE Shape_185 [Shape] outputs: [268 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_187 [Gather] TRT - VERBOSE Searching for input: 268 TRT - VERBOSE Searching for input: 269 TRT - VERBOSE Gather_187 [Gather] inputs: [268 -> (4)[INT32]], [269 -> ()[INT32]], TRT - VERBOSE Registering layer: 269 for ONNX node: 269 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_187 for ONNX node: Gather_187 TRT - VERBOSE Registering tensor: 270 for ONNX tensor: 270 TRT - VERBOSE Gather_187 [Gather] outputs: [270 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_191 [Shape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Shape_191 [Shape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_191 for ONNX node: Shape_191 TRT - VERBOSE Registering tensor: 274 for ONNX tensor: 274 TRT - VERBOSE Shape_191 [Shape] outputs: [274 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_193 [Gather] TRT - VERBOSE Searching for input: 274 TRT - VERBOSE Searching for input: 275 TRT - VERBOSE Gather_193 [Gather] inputs: [274 -> (4)[INT32]], [275 -> ()[INT32]], TRT - VERBOSE Registering layer: 275 for ONNX node: 275 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_193 for ONNX node: Gather_193 TRT - VERBOSE Registering tensor: 276 for ONNX tensor: 276 TRT - VERBOSE Gather_193 [Gather] outputs: [276 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_194 [Shape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Shape_194 [Shape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_194 for ONNX node: Shape_194 TRT - VERBOSE Registering tensor: 277 for ONNX tensor: 277 TRT - VERBOSE Shape_194 [Shape] outputs: [277 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_196 [Gather] TRT - VERBOSE Searching for input: 277 TRT - VERBOSE Searching for input: 278 TRT - VERBOSE Gather_196 [Gather] inputs: [277 -> (4)[INT32]], [278 -> ()[INT32]], TRT - VERBOSE Registering layer: 278 for ONNX node: 278 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_196 for ONNX node: Gather_196 TRT - VERBOSE Registering tensor: 279 for ONNX tensor: 279 TRT - VERBOSE Gather_196 [Gather] outputs: [279 -> ()[INT32]], TRT - VERBOSE Parsing node: Mul_197 [Mul] TRT - VERBOSE Searching for input: 270 TRT - VERBOSE Searching for input: 273 TRT - VERBOSE Mul_197 [Mul] inputs: [270 -> ()[INT32]], [273 -> ()[INT32]], TRT - VERBOSE Registering layer: 273 for ONNX node: 273 TRT - VERBOSE Registering layer: Mul_197 for ONNX node: Mul_197 TRT - VERBOSE Registering tensor: 280 for ONNX tensor: 280 TRT - VERBOSE Mul_197 [Mul] outputs: [280 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_199 [Unsqueeze] TRT - VERBOSE Searching for input: 280 TRT - VERBOSE Searching for input: 282 TRT - VERBOSE Unsqueeze_199 [Unsqueeze] inputs: [280 -> ()[INT32]], [282 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_199 for ONNX node: Unsqueeze_199 TRT - VERBOSE Registering tensor: 283 for ONNX tensor: 283 TRT - VERBOSE Unsqueeze_199 [Unsqueeze] outputs: [283 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_201 [Unsqueeze] TRT - VERBOSE Searching for input: 276 TRT - VERBOSE Searching for input: 286 TRT - VERBOSE Unsqueeze_201 [Unsqueeze] inputs: [276 -> ()[INT32]], [286 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_201 for ONNX node: Unsqueeze_201 TRT - VERBOSE Registering tensor: 287 for ONNX tensor: 287 TRT - VERBOSE Unsqueeze_201 [Unsqueeze] outputs: [287 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_203 [Unsqueeze] TRT - VERBOSE Searching for input: 279 TRT - VERBOSE Searching for input: 288 TRT - VERBOSE Unsqueeze_203 [Unsqueeze] inputs: [279 -> ()[INT32]], [288 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_203 for ONNX node: Unsqueeze_203 TRT - VERBOSE Registering tensor: 289 for ONNX tensor: 289 TRT - VERBOSE Unsqueeze_203 [Unsqueeze] outputs: [289 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_204 [Concat] TRT - VERBOSE Searching for input: 283 TRT - VERBOSE Searching for input: 517 TRT - VERBOSE Searching for input: 287 TRT - VERBOSE Searching for input: 289 TRT - VERBOSE Concat_204 [Concat] inputs: [283 -> (1)[INT32]], [517 -> (1)[INT32]], [287 -> (1)[INT32]], [289 -> (1)[INT32]], TRT - VERBOSE Registering layer: 517 for ONNX node: 517 TRT - VERBOSE Registering layer: Concat_204 for ONNX node: Concat_204 TRT - VERBOSE Registering tensor: 290 for ONNX tensor: 290 TRT - VERBOSE Concat_204 [Concat] outputs: [290 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_205 [Reshape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Searching for input: 290 TRT - VERBOSE Reshape_205 [Reshape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], [290 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_205 for ONNX node: Reshape_205 TRT - VERBOSE Registering tensor: 291 for ONNX tensor: 291 TRT - VERBOSE Reshape_205 [Reshape] outputs: [291 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_219 [Pad] TRT - VERBOSE Searching for input: 291 TRT - VERBOSE Searching for input: 313 TRT - VERBOSE Pad_219 [Pad] inputs: [291 -> (-1, 1, -1, -1)[FLOAT]], [313 -> (8)[INT32]], TRT - VERBOSE Registering layer: Pad_219 for ONNX node: Pad_219 TRT - VERBOSE Registering tensor: 314 for ONNX tensor: 314 TRT - VERBOSE Pad_219 [Pad] outputs: [314 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_220 [Conv] TRT - VERBOSE Searching for input: 314 TRT - VERBOSE Searching for input: 144 TRT - VERBOSE Conv_220 [Conv] inputs: [314 -> (-1, 1, -1, -1)[FLOAT]], [144 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_220 for ONNX node: Conv_220 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (1, 1), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 315 for ONNX tensor: 315 TRT - VERBOSE Conv_220 [Conv] outputs: [315 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_222 [Unsqueeze] TRT - VERBOSE Searching for input: 270 TRT - VERBOSE Searching for input: 316 TRT - VERBOSE Unsqueeze_222 [Unsqueeze] inputs: [270 -> ()[INT32]], [316 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_222 for ONNX node: Unsqueeze_222 TRT - VERBOSE Registering tensor: 317 for ONNX tensor: 317 TRT - VERBOSE Unsqueeze_222 [Unsqueeze] outputs: [317 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_226 [Unsqueeze] TRT - VERBOSE Searching for input: 276 TRT - VERBOSE Searching for input: 320 TRT - VERBOSE Unsqueeze_226 [Unsqueeze] inputs: [276 -> ()[INT32]], [320 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_226 for ONNX node: Unsqueeze_226 TRT - VERBOSE Registering tensor: 321 for ONNX tensor: 321 TRT - VERBOSE Unsqueeze_226 [Unsqueeze] outputs: [321 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_228 [Unsqueeze] TRT - VERBOSE Searching for input: 279 TRT - VERBOSE Searching for input: 322 TRT - VERBOSE Unsqueeze_228 [Unsqueeze] inputs: [279 -> ()[INT32]], [322 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_228 for ONNX node: Unsqueeze_228 TRT - VERBOSE Registering tensor: 323 for ONNX tensor: 323 TRT - VERBOSE Unsqueeze_228 [Unsqueeze] outputs: [323 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_229 [Concat] TRT - VERBOSE Searching for input: 317 TRT - VERBOSE Searching for input: 319 TRT - VERBOSE Searching for input: 321 TRT - VERBOSE Searching for input: 323 TRT - VERBOSE Concat_229 [Concat] inputs: [317 -> (1)[INT32]], [319 -> (1)[INT32]], [321 -> (1)[INT32]], [323 -> (1)[INT32]], TRT - VERBOSE Registering layer: 319 for ONNX node: 319 TRT - VERBOSE Registering layer: Concat_229 for ONNX node: Concat_229 TRT - VERBOSE Registering tensor: 324 for ONNX tensor: 324 TRT - VERBOSE Concat_229 [Concat] outputs: [324 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_230 [Reshape] TRT - VERBOSE Searching for input: 315 TRT - VERBOSE Searching for input: 324 TRT - VERBOSE Reshape_230 [Reshape] inputs: [315 -> (-1, 1, -1, -1)[FLOAT]], [324 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_230 for ONNX node: Reshape_230 TRT - VERBOSE Registering tensor: 325 for ONNX tensor: 325 TRT - VERBOSE Reshape_230 [Reshape] outputs: [325 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMean_231 [ReduceMean] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE ReduceMean_231 [ReduceMean] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMean_231 for ONNX node: ReduceMean_231 TRT - VERBOSE Registering tensor: 326 for ONNX tensor: 326 TRT - VERBOSE ReduceMean_231 [ReduceMean] outputs: [326 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_232 [Sub] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Searching for input: 325 TRT - VERBOSE Sub_232 [Sub] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], [325 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_232 for ONNX node: Sub_232 TRT - VERBOSE Registering tensor: 327 for ONNX tensor: 327 TRT - VERBOSE Sub_232 [Sub] outputs: [327 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_233 [Softplus] TRT - VERBOSE Searching for input: 327 TRT - VERBOSE Softplus_233 [Softplus] inputs: [327 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_233 for ONNX node: Softplus_233 TRT - VERBOSE Registering tensor: 328 for ONNX tensor: 328 TRT - VERBOSE Softplus_233 [Softplus] outputs: [328 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_234 [Sub] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Searching for input: 326 TRT - VERBOSE Sub_234 [Sub] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], [326 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_234 for ONNX node: Sub_234 TRT - VERBOSE Registering tensor: 329 for ONNX tensor: 329 TRT - VERBOSE Sub_234 [Sub] outputs: [329 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_235 [Softplus] TRT - VERBOSE Searching for input: 329 TRT - VERBOSE Softplus_235 [Softplus] inputs: [329 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_235 for ONNX node: Softplus_235 TRT - VERBOSE Registering tensor: 330 for ONNX tensor: 330 TRT - VERBOSE Softplus_235 [Softplus] outputs: [330 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_236 [Mul] TRT - VERBOSE Searching for input: 328 TRT - VERBOSE Searching for input: 330 TRT - VERBOSE Mul_236 [Mul] inputs: [328 -> (-1, 128, -1, -1)[FLOAT]], [330 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_236 for ONNX node: Mul_236 TRT - VERBOSE Registering tensor: 331 for ONNX tensor: 331 TRT - VERBOSE Mul_236 [Mul] outputs: [331 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMax_237 [ReduceMax] TRT - VERBOSE Searching for input: 331 TRT - VERBOSE ReduceMax_237 [ReduceMax] inputs: [331 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMax_237 for ONNX node: ReduceMax_237 TRT - VERBOSE Registering tensor: 332 for ONNX tensor: 332 TRT - VERBOSE ReduceMax_237 [ReduceMax] outputs: [332 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_239 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 333 TRT - VERBOSE Unsqueeze_239 [Unsqueeze] inputs: [36 -> ()[INT32]], [333 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_239 for ONNX node: Unsqueeze_239 TRT - VERBOSE Registering tensor: 334 for ONNX tensor: 334 TRT - VERBOSE Unsqueeze_239 [Unsqueeze] outputs: [334 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_241 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 335 TRT - VERBOSE Unsqueeze_241 [Unsqueeze] inputs: [39 -> ()[INT32]], [335 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_241 for ONNX node: Unsqueeze_241 TRT - VERBOSE Registering tensor: 336 for ONNX tensor: 336 TRT - VERBOSE Unsqueeze_241 [Unsqueeze] outputs: [336 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_242 [Concat] TRT - VERBOSE Searching for input: 334 TRT - VERBOSE Searching for input: 336 TRT - VERBOSE Concat_242 [Concat] inputs: [334 -> (1)[INT32]], [336 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_242 for ONNX node: Concat_242 TRT - VERBOSE Registering tensor: 337 for ONNX tensor: 337 TRT - VERBOSE Concat_242 [Concat] outputs: [337 -> (2)[INT32]], TRT - VERBOSE Parsing node: Shape_243 [Shape] TRT - VERBOSE Searching for input: 332 TRT - VERBOSE Shape_243 [Shape] inputs: [332 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_243 for ONNX node: Shape_243 TRT - VERBOSE Registering tensor: 338 for ONNX tensor: 338 TRT - VERBOSE Shape_243 [Shape] outputs: [338 -> (4)[INT32]], TRT - VERBOSE Parsing node: Slice_247 [Slice] TRT - VERBOSE Searching for input: 338 TRT - VERBOSE Searching for input: 340 TRT - VERBOSE Searching for input: 341 TRT - VERBOSE Searching for input: 339 TRT - VERBOSE Slice_247 [Slice] inputs: [338 -> (4)[INT32]], [340 -> (1)[INT32]], [341 -> (1)[INT32]], [339 -> (1)[INT32]], TRT - VERBOSE Registering layer: Slice_247 for ONNX node: Slice_247 TRT - VERBOSE Registering tensor: 342 for ONNX tensor: 342 TRT - VERBOSE Slice_247 [Slice] outputs: [342 -> (2)[INT32]], TRT - VERBOSE Parsing node: Cast_248 [Cast] TRT - VERBOSE Searching for input: 337 TRT - VERBOSE Cast_248 [Cast] inputs: [337 -> (2)[INT32]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_248 for ONNX node: Cast_248 TRT - VERBOSE Registering tensor: 343 for ONNX tensor: 343 TRT - VERBOSE Cast_248 [Cast] outputs: [343 -> (2)[INT32]], TRT - VERBOSE Parsing node: Concat_249 [Concat] TRT - VERBOSE Searching for input: 342 TRT - VERBOSE Searching for input: 343 TRT - VERBOSE Concat_249 [Concat] inputs: [342 -> (2)[INT32]], [343 -> (2)[INT32]], TRT - VERBOSE Registering layer: Concat_249 for ONNX node: Concat_249 TRT - VERBOSE Registering tensor: 344 for ONNX tensor: 344 TRT - VERBOSE Concat_249 [Concat] outputs: [344 -> (4)[INT32]], TRT - VERBOSE Parsing node: Resize_250 [Resize] TRT - VERBOSE Searching for input: 332 TRT - VERBOSE Searching for input: 344 TRT - VERBOSE Resize_250 [Resize] inputs: [332 -> (-1, 1, -1, -1)[FLOAT]], [optional input, not set], [optional input, not set], [344 -> (4)[INT32]], TRT - VERBOSE Registering layer: Resize_250 for ONNX node: Resize_250 TRT - VERBOSE Registering tensor: 347 for ONNX tensor: 347 TRT - VERBOSE Resize_250 [Resize] outputs: [347 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_252 [Mul] TRT - VERBOSE Searching for input: 348 TRT - VERBOSE Searching for input: 347 TRT - VERBOSE Mul_252 [Mul] inputs: [348 -> ()[FLOAT]], [347 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: 348 for ONNX node: 348 TRT - VERBOSE Registering layer: Mul_252 for ONNX node: Mul_252 TRT - VERBOSE Registering tensor: 349 for ONNX tensor: 349 TRT - VERBOSE Mul_252 [Mul] outputs: [349 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_253 [Add] TRT - VERBOSE Searching for input: 265 TRT - VERBOSE Searching for input: 349 TRT - VERBOSE Add_253 [Add] inputs: [265 -> (-1, 1, -1, -1)[FLOAT]], [349 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_253 for ONNX node: Add_253 TRT - VERBOSE Registering tensor: 350 for ONNX tensor: 350 TRT - VERBOSE Add_253 [Add] outputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_254 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_254 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_254 for ONNX node: Shape_254 TRT - VERBOSE Registering tensor: 351 for ONNX tensor: 351 TRT - VERBOSE Shape_254 [Shape] outputs: [351 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_256 [Gather] TRT - VERBOSE Searching for input: 351 TRT - VERBOSE Searching for input: 352 TRT - VERBOSE Gather_256 [Gather] inputs: [351 -> (4)[INT32]], [352 -> ()[INT32]], TRT - VERBOSE Registering layer: 352 for ONNX node: 352 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_256 for ONNX node: Gather_256 TRT - VERBOSE Registering tensor: 353 for ONNX tensor: 353 TRT - VERBOSE Gather_256 [Gather] outputs: [353 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_257 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_257 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_257 for ONNX node: Shape_257 TRT - VERBOSE Registering tensor: 354 for ONNX tensor: 354 TRT - VERBOSE Shape_257 [Shape] outputs: [354 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_259 [Gather] TRT - VERBOSE Searching for input: 354 TRT - VERBOSE Searching for input: 355 TRT - VERBOSE Gather_259 [Gather] inputs: [354 -> (4)[INT32]], [355 -> ()[INT32]], TRT - VERBOSE Registering layer: 355 for ONNX node: 355 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_259 for ONNX node: Gather_259 TRT - VERBOSE Registering tensor: 356 for ONNX tensor: 356 TRT - VERBOSE Gather_259 [Gather] outputs: [356 -> ()[INT32]], TRT - VERBOSE Parsing node: Greater_261 [Greater] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 357 TRT - VERBOSE Greater_261 [Greater] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [357 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 357 for ONNX node: 357 TRT - VERBOSE Registering layer: Greater_261 for ONNX node: Greater_261 TRT - VERBOSE Registering tensor: 358 for ONNX tensor: 358 TRT - VERBOSE Greater_261 [Greater] outputs: [358 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: MaxPool_262 [MaxPool] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE MaxPool_262 [MaxPool] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: MaxPool_262 for ONNX node: MaxPool_262 TRT - VERBOSE Registering tensor: 359 for ONNX tensor: 359 TRT - VERBOSE MaxPool_262 [MaxPool] outputs: [359 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Equal_263 [Equal] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 359 TRT - VERBOSE Equal_263 [Equal] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [359 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Equal_263 for ONNX node: Equal_263 TRT - VERBOSE Registering tensor: 360 for ONNX tensor: 360 TRT - VERBOSE Equal_263 [Equal] outputs: [360 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_264 [Cast] TRT - VERBOSE Searching for input: 360 TRT - VERBOSE Cast_264 [Cast] inputs: [360 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_264 for ONNX node: Cast_264 TRT - VERBOSE Registering tensor: 361 for ONNX tensor: 361 TRT - VERBOSE Cast_264 [Cast] outputs: [361 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_265 [Cast] TRT - VERBOSE Searching for input: 358 TRT - VERBOSE Cast_265 [Cast] inputs: [358 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_265 for ONNX node: Cast_265 TRT - VERBOSE Registering tensor: 362 for ONNX tensor: 362 TRT - VERBOSE Cast_265 [Cast] outputs: [362 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_266 [And] TRT - VERBOSE Searching for input: 361 TRT - VERBOSE Searching for input: 362 TRT - VERBOSE And_266 [And] inputs: [361 -> (-1, 1, -1, -1)[BOOL]], [362 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_266 for ONNX node: And_266 TRT - VERBOSE Registering tensor: 363 for ONNX tensor: 363 TRT - VERBOSE And_266 [And] outputs: [363 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_267 [Cast] TRT - VERBOSE Searching for input: 363 TRT - VERBOSE Cast_267 [Cast] inputs: [363 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_267 for ONNX node: Cast_267 TRT - VERBOSE Registering tensor: 364 for ONNX tensor: 364 TRT - VERBOSE Cast_267 [Cast] outputs: [364 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Sub_269 [Sub] TRT - VERBOSE Searching for input: 353 TRT - VERBOSE Searching for input: 365 TRT - VERBOSE Sub_269 [Sub] inputs: [353 -> ()[INT32]], [365 -> ()[INT32]], TRT - VERBOSE Registering layer: 365 for ONNX node: 365 TRT - VERBOSE Registering layer: Sub_269 for ONNX node: Sub_269 TRT - VERBOSE Registering tensor: 366 for ONNX tensor: 366 TRT - VERBOSE Sub_269 [Sub] outputs: [366 -> ()[INT32]], TRT - VERBOSE Parsing node: Sub_271 [Sub] TRT - VERBOSE Searching for input: 356 TRT - VERBOSE Searching for input: 367 TRT - VERBOSE Sub_271 [Sub] inputs: [356 -> ()[INT32]], [367 -> ()[INT32]], TRT - VERBOSE Registering layer: 367 for ONNX node: 367 TRT - VERBOSE Registering layer: Sub_271 for ONNX node: Sub_271 TRT - VERBOSE Registering tensor: 368 for ONNX tensor: 368 TRT - VERBOSE Sub_271 [Sub] outputs: [368 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_273 [Unsqueeze] TRT - VERBOSE Searching for input: 366 TRT - VERBOSE Searching for input: 375 TRT - VERBOSE Unsqueeze_273 [Unsqueeze] inputs: [366 -> ()[INT32]], [375 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_273 for ONNX node: Unsqueeze_273 TRT - VERBOSE Registering tensor: 376 for ONNX tensor: 376 TRT - VERBOSE Unsqueeze_273 [Unsqueeze] outputs: [376 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_275 [Unsqueeze] TRT - VERBOSE Searching for input: 368 TRT - VERBOSE Searching for input: 377 TRT - VERBOSE Unsqueeze_275 [Unsqueeze] inputs: [368 -> ()[INT32]], [377 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_275 for ONNX node: Unsqueeze_275 TRT - VERBOSE Registering tensor: 378 for ONNX tensor: 378 TRT - VERBOSE Unsqueeze_275 [Unsqueeze] outputs: [378 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_276 [Concat] TRT - VERBOSE Searching for input: 523 TRT - VERBOSE Searching for input: 524 TRT - VERBOSE Searching for input: 376 TRT - VERBOSE Searching for input: 378 TRT - VERBOSE Concat_276 [Concat] inputs: [523 -> (1)[INT32]], [524 -> (1)[INT32]], [376 -> (1)[INT32]], [378 -> (1)[INT32]], TRT - VERBOSE Registering layer: 523 for ONNX node: 523 TRT - VERBOSE Registering layer: 524 for ONNX node: 524 TRT - VERBOSE Registering layer: Concat_276 for ONNX node: Concat_276 TRT - VERBOSE Registering tensor: 379 for ONNX tensor: 379 TRT - VERBOSE Concat_276 [Concat] outputs: [379 -> (4)[INT32]], TRT - VERBOSE Parsing node: ConstantOfShape_277 [ConstantOfShape] TRT - VERBOSE Searching for input: 379 TRT - VERBOSE ConstantOfShape_277 [ConstantOfShape] inputs: [379 -> (4)[INT32]], TRT - VERBOSE Registering layer: ConstantOfShape_277 for ONNX node: ConstantOfShape_277 TRT - VERBOSE Registering tensor: 380 for ONNX tensor: 380 TRT - VERBOSE ConstantOfShape_277 [ConstantOfShape] outputs: [380 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Cast_278 [Cast] TRT - VERBOSE Searching for input: 380 TRT - VERBOSE Cast_278 [Cast] inputs: [380 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Casting to type: float32 TRT - VERBOSE Registering layer: Cast_278 for ONNX node: Cast_278 TRT - VERBOSE Registering tensor: 381 for ONNX tensor: 381 TRT - VERBOSE Cast_278 [Cast] outputs: [381 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_293 [Pad] TRT - VERBOSE Searching for input: 381 TRT - VERBOSE Searching for input: 403 TRT - VERBOSE Searching for input: 404 TRT - VERBOSE Pad_293 [Pad] inputs: [381 -> (1, 1, -1, -1)[FLOAT]], [403 -> (8)[INT32]], [404 -> ()[FLOAT]], TRT - VERBOSE Registering layer: Pad_293 for ONNX node: Pad_293 TRT - VERBOSE Registering tensor: 405 for ONNX tensor: 405 TRT - VERBOSE Pad_293 [Pad] outputs: [405 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Cast_294 [Cast] TRT - VERBOSE Searching for input: 405 TRT - VERBOSE Cast_294 [Cast] inputs: [405 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_294 for ONNX node: Cast_294 TRT - VERBOSE Registering tensor: 406 for ONNX tensor: 406 TRT - VERBOSE Cast_294 [Cast] outputs: [406 -> (1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_295 [Cast] TRT - VERBOSE Searching for input: 406 TRT - VERBOSE Cast_295 [Cast] inputs: [406 -> (1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_295 for ONNX node: Cast_295 TRT - VERBOSE Registering tensor: 407 for ONNX tensor: 407 TRT - VERBOSE Cast_295 [Cast] outputs: [407 -> (1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_296 [Cast] TRT - VERBOSE Searching for input: 364 TRT - VERBOSE Cast_296 [Cast] inputs: [364 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_296 for ONNX node: Cast_296 TRT - VERBOSE Registering tensor: 408 for ONNX tensor: 408 TRT - VERBOSE Cast_296 [Cast] outputs: [408 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_297 [And] TRT - VERBOSE Searching for input: 407 TRT - VERBOSE Searching for input: 408 TRT - VERBOSE And_297 [And] inputs: [407 -> (1, 1, -1, -1)[BOOL]], [408 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_297 for ONNX node: And_297 TRT - VERBOSE Registering tensor: 409 for ONNX tensor: 409 TRT - VERBOSE And_297 [And] outputs: [409 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_298 [Cast] TRT - VERBOSE Searching for input: 409 TRT - VERBOSE Cast_298 [Cast] inputs: [409 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_298 for ONNX node: Cast_298 TRT - VERBOSE Registering tensor: 410 for ONNX tensor: 410 TRT - VERBOSE Cast_298 [Cast] outputs: [410 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Pad_313 [Pad] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 432 TRT - VERBOSE Searching for input: 433 TRT - VERBOSE Pad_313 [Pad] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [432 -> (8)[INT32]], [433 -> ()[FLOAT]], TRT - VERBOSE Registering layer: Pad_313 for ONNX node: Pad_313 TRT - VERBOSE Registering tensor: 434 for ONNX tensor: 434 TRT - VERBOSE Pad_313 [Pad] outputs: [434 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_315 [Conv] TRT - VERBOSE Searching for input: 434 TRT - VERBOSE Searching for input: 435 TRT - VERBOSE Conv_315 [Conv] inputs: [434 -> (-1, 1, -1, -1)[FLOAT]], [435 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_315 for ONNX node: Conv_315 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 436 for ONNX tensor: 436 TRT - VERBOSE Conv_315 [Conv] outputs: [436 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_317 [Conv] TRT - VERBOSE Searching for input: 434 TRT - VERBOSE Searching for input: 437 TRT - VERBOSE Conv_317 [Conv] inputs: [434 -> (-1, 1, -1, -1)[FLOAT]], [437 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_317 for ONNX node: Conv_317 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 438 for ONNX tensor: 438 TRT - VERBOSE Conv_317 [Conv] outputs: [438 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_319 [Conv] TRT - VERBOSE Searching for input: 434 TRT - VERBOSE Searching for input: 439 TRT - VERBOSE Conv_319 [Conv] inputs: [434 -> (-1, 1, -1, -1)[FLOAT]], [439 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_319 for ONNX node: Conv_319 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 440 for ONNX tensor: 440 TRT - VERBOSE Conv_319 [Conv] outputs: [440 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_320 [Mul] TRT - VERBOSE Searching for input: 436 TRT - VERBOSE Searching for input: 440 TRT - VERBOSE Mul_320 [Mul] inputs: [436 -> (-1, 1, -1, -1)[FLOAT]], [440 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_320 for ONNX node: Mul_320 TRT - VERBOSE Registering tensor: 441 for ONNX tensor: 441 TRT - VERBOSE Mul_320 [Mul] outputs: [441 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_321 [Mul] TRT - VERBOSE Searching for input: 438 TRT - VERBOSE Searching for input: 438 TRT - VERBOSE Mul_321 [Mul] inputs: [438 -> (-1, 1, -1, -1)[FLOAT]], [438 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_321 for ONNX node: Mul_321 TRT - VERBOSE Registering tensor: 442 for ONNX tensor: 442 TRT - VERBOSE Mul_321 [Mul] outputs: [442 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_322 [Sub] TRT - VERBOSE Searching for input: 441 TRT - VERBOSE Searching for input: 442 TRT - VERBOSE Sub_322 [Sub] inputs: [441 -> (-1, 1, -1, -1)[FLOAT]], [442 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_322 for ONNX node: Sub_322 TRT - VERBOSE Registering tensor: 443 for ONNX tensor: 443 TRT - VERBOSE Sub_322 [Sub] outputs: [443 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_323 [Add] TRT - VERBOSE Searching for input: 436 TRT - VERBOSE Searching for input: 440 TRT - VERBOSE Add_323 [Add] inputs: [436 -> (-1, 1, -1, -1)[FLOAT]], [440 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_323 for ONNX node: Add_323 TRT - VERBOSE Registering tensor: 444 for ONNX tensor: 444 TRT - VERBOSE Add_323 [Add] outputs: [444 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pow_325 [Pow] TRT - VERBOSE Searching for input: 444 TRT - VERBOSE Searching for input: 445 TRT - VERBOSE Pow_325 [Pow] inputs: [444 -> (-1, 1, -1, -1)[FLOAT]], [445 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 445 for ONNX node: 445 TRT - VERBOSE Registering layer: Pow_325 for ONNX node: Pow_325 TRT - VERBOSE Registering tensor: 446 for ONNX tensor: 446 TRT - VERBOSE Pow_325 [Pow] outputs: [446 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_326 [Div] TRT - VERBOSE Searching for input: 446 TRT - VERBOSE Searching for input: 443 TRT - VERBOSE Div_326 [Div] inputs: [446 -> (-1, 1, -1, -1)[FLOAT]], [443 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Div_326 for ONNX node: Div_326 TRT - VERBOSE Registering tensor: 447 for ONNX tensor: 447 TRT - VERBOSE Div_326 [Div] outputs: [447 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: LessOrEqual_328 [LessOrEqual] TRT - VERBOSE Searching for input: 447 TRT - VERBOSE Searching for input: 448 TRT - VERBOSE LessOrEqual_328 [LessOrEqual] inputs: [447 -> (-1, 1, -1, -1)[FLOAT]], [448 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 448 for ONNX node: 448 TRT - VERBOSE Registering layer: LessOrEqual_328 for ONNX node: LessOrEqual_328 TRT - VERBOSE Registering layer: LessOrEqual_328_97 for ONNX node: LessOrEqual_328 TRT - VERBOSE Registering layer: LessOrEqual_328_98 for ONNX node: LessOrEqual_328 TRT - VERBOSE Registering tensor: 449 for ONNX tensor: 449 TRT - VERBOSE LessOrEqual_328 [LessOrEqual] outputs: [449 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Greater_330 [Greater] TRT - VERBOSE Searching for input: 443 TRT - VERBOSE Searching for input: 450 TRT - VERBOSE Greater_330 [Greater] inputs: [443 -> (-1, 1, -1, -1)[FLOAT]], [450 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 450 for ONNX node: 450 TRT - VERBOSE Registering layer: Greater_330 for ONNX node: Greater_330 TRT - VERBOSE Registering tensor: 451 for ONNX tensor: 451 TRT - VERBOSE Greater_330 [Greater] outputs: [451 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_331 [Cast] TRT - VERBOSE Searching for input: 449 TRT - VERBOSE Cast_331 [Cast] inputs: [449 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_331 for ONNX node: Cast_331 TRT - VERBOSE Registering tensor: 452 for ONNX tensor: 452 TRT - VERBOSE Cast_331 [Cast] outputs: [452 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_332 [Cast] TRT - VERBOSE Searching for input: 451 TRT - VERBOSE Cast_332 [Cast] inputs: [451 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_332 for ONNX node: Cast_332 TRT - VERBOSE Registering tensor: 453 for ONNX tensor: 453 TRT - VERBOSE Cast_332 [Cast] outputs: [453 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_333 [And] TRT - VERBOSE Searching for input: 452 TRT - VERBOSE Searching for input: 453 TRT - VERBOSE And_333 [And] inputs: [452 -> (-1, 1, -1, -1)[BOOL]], [453 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_333 for ONNX node: And_333 TRT - VERBOSE Registering tensor: 454 for ONNX tensor: 454 TRT - VERBOSE And_333 [And] outputs: [454 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_334 [Cast] TRT - VERBOSE Searching for input: 454 TRT - VERBOSE Cast_334 [Cast] inputs: [454 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_334 for ONNX node: Cast_334 TRT - VERBOSE Registering tensor: 455 for ONNX tensor: 455 TRT - VERBOSE Cast_334 [Cast] outputs: [455 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_335 [Cast] TRT - VERBOSE Searching for input: 455 TRT - VERBOSE Cast_335 [Cast] inputs: [455 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_335 for ONNX node: Cast_335 TRT - VERBOSE Registering tensor: 456 for ONNX tensor: 456 TRT - VERBOSE Cast_335 [Cast] outputs: [456 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_336 [Cast] TRT - VERBOSE Searching for input: 410 TRT - VERBOSE Cast_336 [Cast] inputs: [410 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_336 for ONNX node: Cast_336 TRT - VERBOSE Registering tensor: 457 for ONNX tensor: 457 TRT - VERBOSE Cast_336 [Cast] outputs: [457 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_337 [And] TRT - VERBOSE Searching for input: 456 TRT - VERBOSE Searching for input: 457 TRT - VERBOSE And_337 [And] inputs: [456 -> (-1, 1, -1, -1)[BOOL]], [457 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_337 for ONNX node: And_337 TRT - VERBOSE Registering tensor: 458 for ONNX tensor: 458 TRT - VERBOSE And_337 [And] outputs: [458 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_338 [Cast] TRT - VERBOSE Searching for input: 458 TRT - VERBOSE Cast_338 [Cast] inputs: [458 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_338 for ONNX node: Cast_338 TRT - VERBOSE Registering tensor: 459 for ONNX tensor: 459 TRT - VERBOSE Cast_338 [Cast] outputs: [459 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_339 [Cast] TRT - VERBOSE Searching for input: 459 TRT - VERBOSE Cast_339 [Cast] inputs: [459 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_339 for ONNX node: Cast_339 TRT - VERBOSE Registering tensor: 460 for ONNX tensor: 460 TRT - VERBOSE Cast_339 [Cast] outputs: [460 -> (-1, 1, -1, -1)[INT32]], TRT - VERBOSE Parsing node: Shape_340 [Shape] TRT - VERBOSE Searching for input: 460 TRT - VERBOSE Shape_340 [Shape] inputs: [460 -> (-1, 1, -1, -1)[INT32]], TRT - VERBOSE Registering layer: Shape_340 for ONNX node: Shape_340 TRT - VERBOSE Registering tensor: 461 for ONNX tensor: 461 TRT - VERBOSE Shape_340 [Shape] outputs: [461 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_342 [Gather] TRT - VERBOSE Searching for input: 461 TRT - VERBOSE Searching for input: 462 TRT - VERBOSE Gather_342 [Gather] inputs: [461 -> (4)[INT32]], [462 -> ()[INT32]], TRT - VERBOSE Registering layer: 462 for ONNX node: 462 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_342 for ONNX node: Gather_342 TRT - VERBOSE Registering tensor: 463 for ONNX tensor: 463 TRT - VERBOSE Gather_342 [Gather] outputs: [463 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_343 [Shape] TRT - VERBOSE Searching for input: 460 TRT - VERBOSE Shape_343 [Shape] inputs: [460 -> (-1, 1, -1, -1)[INT32]], TRT - VERBOSE Registering layer: Shape_343 for ONNX node: Shape_343 TRT - VERBOSE Registering tensor: 464 for ONNX tensor: 464 TRT - VERBOSE Shape_343 [Shape] outputs: [464 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_345 [Gather] TRT - VERBOSE Searching for input: 464 TRT - VERBOSE Searching for input: 465 TRT - VERBOSE Gather_345 [Gather] inputs: [464 -> (4)[INT32]], [465 -> ()[INT32]], TRT - VERBOSE Registering layer: 465 for ONNX node: 465 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_345 for ONNX node: Gather_345 TRT - VERBOSE Registering tensor: 466 for ONNX tensor: 466 TRT - VERBOSE Gather_345 [Gather] outputs: [466 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_347 [Unsqueeze] TRT - VERBOSE Searching for input: 463 TRT - VERBOSE Searching for input: 467 TRT - VERBOSE Unsqueeze_347 [Unsqueeze] inputs: [463 -> ()[INT32]], [467 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_347 for ONNX node: Unsqueeze_347 TRT - VERBOSE Registering tensor: 468 for ONNX tensor: 468 TRT - VERBOSE Unsqueeze_347 [Unsqueeze] outputs: [468 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_349 [Unsqueeze] TRT - VERBOSE Searching for input: 466 TRT - VERBOSE Searching for input: 469 TRT - VERBOSE Unsqueeze_349 [Unsqueeze] inputs: [466 -> ()[INT32]], [469 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_349 for ONNX node: Unsqueeze_349 TRT - VERBOSE Registering tensor: 470 for ONNX tensor: 470 TRT - VERBOSE Unsqueeze_349 [Unsqueeze] outputs: [470 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_350 [Concat] TRT - VERBOSE Searching for input: 468 TRT - VERBOSE Searching for input: 470 TRT - VERBOSE Concat_350 [Concat] inputs: [468 -> (1)[INT32]], [470 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_350 for ONNX node: Concat_350 TRT - VERBOSE Registering tensor: 471 for ONNX tensor: 471 TRT - VERBOSE Concat_350 [Concat] outputs: [471 -> (2)[INT32]], TRT - VERBOSE Parsing node: Reshape_351 [Reshape] TRT - VERBOSE Searching for input: 460 TRT - VERBOSE Searching for input: 471 TRT - VERBOSE Reshape_351 [Reshape] inputs: [460 -> (-1, 1, -1, -1)[INT32]], [471 -> (2)[INT32]], TRT - VERBOSE Registering layer: Reshape_351 for ONNX node: Reshape_351 TRT - VERBOSE Registering tensor: 472 for ONNX tensor: 472 TRT - VERBOSE Reshape_351 [Reshape] outputs: [472 -> (-1, -1)[INT32]], TRT - VERBOSE Parsing node: Shape_352 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_352 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_352 for ONNX node: Shape_352 TRT - VERBOSE Registering tensor: 473 for ONNX tensor: 473 TRT - VERBOSE Shape_352 [Shape] outputs: [473 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_354 [Gather] TRT - VERBOSE Searching for input: 473 TRT - VERBOSE Searching for input: 474 TRT - VERBOSE Gather_354 [Gather] inputs: [473 -> (4)[INT32]], [474 -> ()[INT32]], TRT - VERBOSE Registering layer: 474 for ONNX node: 474 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_354 for ONNX node: Gather_354 TRT - VERBOSE Registering tensor: 475 for ONNX tensor: 475 TRT - VERBOSE Gather_354 [Gather] outputs: [475 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_355 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_355 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_355 for ONNX node: Shape_355 TRT - VERBOSE Registering tensor: 476 for ONNX tensor: 476 TRT - VERBOSE Shape_355 [Shape] outputs: [476 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_357 [Gather] TRT - VERBOSE Searching for input: 476 TRT - VERBOSE Searching for input: 477 TRT - VERBOSE Gather_357 [Gather] inputs: [476 -> (4)[INT32]], [477 -> ()[INT32]], TRT - VERBOSE Registering layer: 477 for ONNX node: 477 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_357 for ONNX node: Gather_357 TRT - VERBOSE Registering tensor: 478 for ONNX tensor: 478 TRT - VERBOSE Gather_357 [Gather] outputs: [478 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_359 [Unsqueeze] TRT - VERBOSE Searching for input: 475 TRT - VERBOSE Searching for input: 479 TRT - VERBOSE Unsqueeze_359 [Unsqueeze] inputs: [475 -> ()[INT32]], [479 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_359 for ONNX node: Unsqueeze_359 TRT - VERBOSE Registering tensor: 480 for ONNX tensor: 480 TRT - VERBOSE Unsqueeze_359 [Unsqueeze] outputs: [480 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_361 [Unsqueeze] TRT - VERBOSE Searching for input: 478 TRT - VERBOSE Searching for input: 481 TRT - VERBOSE Unsqueeze_361 [Unsqueeze] inputs: [478 -> ()[INT32]], [481 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_361 for ONNX node: Unsqueeze_361 TRT - VERBOSE Registering tensor: 482 for ONNX tensor: 482 TRT - VERBOSE Unsqueeze_361 [Unsqueeze] outputs: [482 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_362 [Concat] TRT - VERBOSE Searching for input: 480 TRT - VERBOSE Searching for input: 482 TRT - VERBOSE Concat_362 [Concat] inputs: [480 -> (1)[INT32]], [482 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_362 for ONNX node: Concat_362 TRT - VERBOSE Registering tensor: 483 for ONNX tensor: 483 TRT - VERBOSE Concat_362 [Concat] outputs: [483 -> (2)[INT32]], TRT - VERBOSE Parsing node: Reshape_363 [Reshape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 483 TRT - VERBOSE Reshape_363 [Reshape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [483 -> (2)[INT32]], TRT - VERBOSE Registering layer: Reshape_363 for ONNX node: Reshape_363 TRT - VERBOSE Registering tensor: score_map_99 for ONNX tensor: score_map TRT - VERBOSE Reshape_363 [Reshape] outputs: [score_map -> (-1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Cast_364 [Cast] TRT - VERBOSE Searching for input: 472 TRT - VERBOSE Cast_364 [Cast] inputs: [472 -> (-1, -1)[INT32]], TRT - VERBOSE Casting to type: float32 TRT - VERBOSE Registering layer: Cast_364 for ONNX node: Cast_364 TRT - VERBOSE Registering tensor: mask_100 for ONNX tensor: mask TRT - VERBOSE Cast_364 [Cast] outputs: [mask -> (-1, -1)[FLOAT]], TRT - VERBOSE Marking mask_100 as output: mask TRT - VERBOSE Marking score_map_99 as output: score_map TRT - VERBOSE Marking dense_feat_map_82 as output: dense_feat_map =============================================================== TensorRT model supports report: Model is fully supported by TRT and can be parsed by it! The model is fully supported by the TRT. Printing the parsed graph: subGraph index - 0 : Fully supported __________________________ Model Supported nodes: subGraphIndex NodeIndex Operator Support 0 0 0 Shape True 1 0 1 Gather True 4 0 4 Conv True 5 0 5 Relu True 7 0 7 BatchNormalization True 25 0 25 Unsqueeze True 28 0 28 Concat True 29 0 29 ConstantOfShape True 30 0 30 Div True 37 0 37 Mul True 42 0 42 Reshape True 43 0 43 Pad True 50 0 50 ReduceMean True 51 0 51 Sub True 52 0 52 Softplus True 56 0 56 ReduceMax True 61 0 61 Slice True 62 0 62 Cast True 64 0 64 Resize True 66 0 66 Add True 145 0 145 Greater True 146 0 146 MaxPool True 147 0 147 Equal True 150 0 150 And True 173 0 173 Pow True 175 0 175 LessOrEqual True Operator Unsqueeze 33 Shape 21 Cast 19 Gather 18 Concat 16 Conv 15 Mul 11 Sub 9 Reshape 8 Relu 8 Softplus 6 Pad 5 And 4 Div 4 Add 4 ReduceMax 3 ReduceMean 3 Resize 3 Slice 3 Greater 2 ConstantOfShape 2 BatchNormalization 2 LessOrEqual 1 Equal 1 Pow 1 MaxPool 1 Name: count, dtype: int64 __________________________ Model NotSupported nodes: Empty DataFrame Columns: [subGraphIndex, NodeIndex, Operator, Support, count] Index: [] Operator Unsqueeze 33 Shape 21 Cast 19 Gather 18 Concat 16 Conv 15 Mul 11 Sub 9 Reshape 8 Relu 8 Softplus 6 Pad 5 And 4 Div 4 Add 4 ReduceMax 3 ReduceMean 3 Resize 3 Slice 3 Greater 2 ConstantOfShape 2 BatchNormalization 2 LessOrEqual 1 Equal 1 Pow 1 MaxPool 1 Name: count, dtype: int64 __________________________ Model topology: subGraphIndex NodeIndex Operator Support count 0 0 0 Shape True 1 1 0 1 Gather True 1 2 0 2 Shape True 1 3 0 3 Gather True 1 4 0 4 Conv True 1 5 0 5 Relu True 1 6 0 6 Conv True 1 7 0 7 BatchNormalization True 1 8 0 8 Relu True 1 9 0 9 Conv True 1 10 0 10 Relu True 1 11 0 11 Conv True 1 12 0 12 BatchNormalization True 1 13 0 13 Relu True 1 14 0 14 Conv True 1 15 0 15 Relu True 1 16 0 16 Conv True 1 17 0 17 Relu True 1 18 0 18 Conv True 1 19 0 19 Relu True 1 20 0 20 Conv True 1 21 0 21 Relu True 1 22 0 22 Conv True 1 23 0 23 Shape True 1 24 0 24 Gather True 1 25 0 25 Unsqueeze True 1 26 0 26 Unsqueeze True 1 27 0 27 Unsqueeze True 1 28 0 28 Concat True 1 29 0 29 ConstantOfShape True 1 30 0 30 Div True 1 31 0 31 Shape True 1 32 0 32 Gather True 1 33 0 33 Shape True 1 34 0 34 Gather True 1 35 0 35 Shape True 1 36 0 36 Gather True 1 37 0 37 Mul True 1 38 0 38 Unsqueeze True 1 39 0 39 Unsqueeze True 1 40 0 40 Unsqueeze True 1 41 0 41 Concat True 1 42 0 42 Reshape True 1 43 0 43 Pad True 1 44 0 44 Conv True 1 45 0 45 Unsqueeze True 1 46 0 46 Unsqueeze True 1 47 0 47 Unsqueeze True 1 48 0 48 Concat True 1 49 0 49 Reshape True 1 50 0 50 ReduceMean True 1 51 0 51 Sub True 1 52 0 52 Softplus True 1 53 0 53 Sub True 1 54 0 54 Softplus True 1 55 0 55 Mul True 1 56 0 56 ReduceMax True 1 57 0 57 Unsqueeze True 1 58 0 58 Unsqueeze True 1 59 0 59 Concat True 1 60 0 60 Shape True 1 61 0 61 Slice True 1 62 0 62 Cast True 1 63 0 63 Concat True 1 64 0 64 Resize True 1 65 0 65 Mul True 1 66 0 66 Add True 1 67 0 67 Div True 1 68 0 68 Shape True 1 69 0 69 Gather True 1 70 0 70 Shape True 1 71 0 71 Gather True 1 72 0 72 Shape True 1 73 0 73 Gather True 1 74 0 74 Mul True 1 75 0 75 Unsqueeze True 1 76 0 76 Unsqueeze True 1 77 0 77 Unsqueeze True 1 78 0 78 Concat True 1 79 0 79 Reshape True 1 80 0 80 Pad True 1 81 0 81 Conv True 1 82 0 82 Unsqueeze True 1 83 0 83 Unsqueeze True 1 84 0 84 Unsqueeze True 1 85 0 85 Concat True 1 86 0 86 Reshape True 1 87 0 87 ReduceMean True 1 88 0 88 Sub True 1 89 0 89 Softplus True 1 90 0 90 Sub True 1 91 0 91 Softplus True 1 92 0 92 Mul True 1 93 0 93 ReduceMax True 1 94 0 94 Unsqueeze True 1 95 0 95 Unsqueeze True 1 96 0 96 Concat True 1 97 0 97 Shape True 1 98 0 98 Slice True 1 99 0 99 Cast True 1 100 0 100 Concat True 1 101 0 101 Resize True 1 102 0 102 Mul True 1 103 0 103 Add True 1 104 0 104 Div True 1 105 0 105 Shape True 1 106 0 106 Gather True 1 107 0 107 Shape True 1 108 0 108 Gather True 1 109 0 109 Shape True 1 110 0 110 Gather True 1 111 0 111 Mul True 1 112 0 112 Unsqueeze True 1 113 0 113 Unsqueeze True 1 114 0 114 Unsqueeze True 1 115 0 115 Concat True 1 116 0 116 Reshape True 1 117 0 117 Pad True 1 118 0 118 Conv True 1 119 0 119 Unsqueeze True 1 120 0 120 Unsqueeze True 1 121 0 121 Unsqueeze True 1 122 0 122 Concat True 1 123 0 123 Reshape True 1 124 0 124 ReduceMean True 1 125 0 125 Sub True 1 126 0 126 Softplus True 1 127 0 127 Sub True 1 128 0 128 Softplus True 1 129 0 129 Mul True 1 130 0 130 ReduceMax True 1 131 0 131 Unsqueeze True 1 132 0 132 Unsqueeze True 1 133 0 133 Concat True 1 134 0 134 Shape True 1 135 0 135 Slice True 1 136 0 136 Cast True 1 137 0 137 Concat True 1 138 0 138 Resize True 1 139 0 139 Mul True 1 140 0 140 Add True 1 141 0 141 Shape True 1 142 0 142 Gather True 1 143 0 143 Shape True 1 144 0 144 Gather True 1 145 0 145 Greater True 1 146 0 146 MaxPool True 1 147 0 147 Equal True 1 148 0 148 Cast True 1 149 0 149 Cast True 1 150 0 150 And True 1 151 0 151 Cast True 1 152 0 152 Sub True 1 153 0 153 Sub True 1 154 0 154 Unsqueeze True 1 155 0 155 Unsqueeze True 1 156 0 156 Concat True 1 157 0 157 ConstantOfShape True 1 158 0 158 Cast True 1 159 0 159 Pad True 1 160 0 160 Cast True 1 161 0 161 Cast True 1 162 0 162 Cast True 1 163 0 163 And True 1 164 0 164 Cast True 1 165 0 165 Pad True 1 166 0 166 Conv True 1 167 0 167 Conv True 1 168 0 168 Conv True 1 169 0 169 Mul True 1 170 0 170 Mul True 1 171 0 171 Sub True 1 172 0 172 Add True 1 173 0 173 Pow True 1 174 0 174 Div True 1 175 0 175 LessOrEqual True 1 176 0 176 Greater True 1 177 0 177 Cast True 1 178 0 178 Cast True 1 179 0 179 And True 1 180 0 180 Cast True 1 181 0 181 Cast True 1 182 0 182 Cast True 1 183 0 183 And True 1 184 0 184 Cast True 1 185 0 185 Cast True 1 186 0 186 Shape True 1 187 0 187 Gather True 1 188 0 188 Shape True 1 189 0 189 Gather True 1 190 0 190 Unsqueeze True 1 191 0 191 Unsqueeze True 1 192 0 192 Concat True 1 193 0 193 Reshape True 1 194 0 194 Shape True 1 195 0 195 Gather True 1 196 0 196 Shape True 1 197 0 197 Gather True 1 198 0 198 Unsqueeze True 1 199 0 199 Unsqueeze True 1 200 0 200 Concat True 1 201 0 201 Reshape True 1 202 0 202 Cast True 1 =============================================================== Beginning Onnx file parsing TRT - INFO ---------------------------------------------------------------- TRT - INFO Input filename: c:/AAG/HPC/Apps/OKZB/ASLtorch/model_rand_weights_folded.onnx TRT - INFO ONNX IR version: 0.0.7 TRT - INFO Opset version: 13 TRT - INFO Producer name: TRT - INFO Producer version: TRT - INFO Domain: TRT - INFO Model version: 0 TRT - INFO Doc string: TRT - INFO ---------------------------------------------------------------- TRT - VERBOSE Plugin creator already registered - ::GridAnchor_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::GridAnchorRect_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::NMS_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::Reorg_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::Region_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::Clip_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::LReLU_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::PriorBox_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::Normalize_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::ScatterND version 1 TRT - VERBOSE Plugin creator already registered - ::RPROI_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::BatchedNMS_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::BatchedNMSDynamic_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::FlattenConcat_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::CropAndResize version 1 TRT - VERBOSE Plugin creator already registered - ::DetectionLayer_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::EfficientNMS_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::EfficientNMS_ONNX_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::EfficientNMS_Explicit_TF_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::EfficientNMS_Implicit_TF_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::Proposal version 1 TRT - VERBOSE Plugin creator already registered - ::ProposalLayer_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::PyramidROIAlign_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::ResizeNearest_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::Split version 1 TRT - VERBOSE Plugin creator already registered - ::SpecialSlice_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::InstanceNormalization_TRT version 1 TRT - VERBOSE Plugin creator already registered - ::InstanceNormalization_TRT version 2 TRT - VERBOSE Adding network input: input with dtype: float32, dimensions: (-1, -1, -1, -1) TRT - VERBOSE Registering tensor: input for ONNX tensor: input TRT - VERBOSE Importing initializer: 35 TRT - VERBOSE Importing initializer: 38 TRT - VERBOSE Importing initializer: 487 TRT - VERBOSE Importing initializer: 488 TRT - VERBOSE Importing initializer: descriptor_map.layer_01.3.weight TRT - VERBOSE Importing initializer: 46 TRT - VERBOSE Importing initializer: 47 TRT - VERBOSE Importing initializer: descriptor_map.layer_23.0.running_mean TRT - VERBOSE Importing initializer: descriptor_map.layer_23.0.running_var TRT - VERBOSE Importing initializer: 490 TRT - VERBOSE Importing initializer: 491 TRT - VERBOSE Importing initializer: descriptor_map.layer_23.5.weight TRT - VERBOSE Importing initializer: 56 TRT - VERBOSE Importing initializer: 57 TRT - VERBOSE Importing initializer: descriptor_map.layer_45.0.running_mean TRT - VERBOSE Importing initializer: descriptor_map.layer_45.0.running_var TRT - VERBOSE Importing initializer: 493 TRT - VERBOSE Importing initializer: 494 TRT - VERBOSE Importing initializer: 496 TRT - VERBOSE Importing initializer: 497 TRT - VERBOSE Importing initializer: 499 TRT - VERBOSE Importing initializer: 500 TRT - VERBOSE Importing initializer: 502 TRT - VERBOSE Importing initializer: 503 TRT - VERBOSE Importing initializer: descriptor_map.layer_678.6.weight TRT - VERBOSE Importing initializer: 82 TRT - VERBOSE Importing initializer: 85 TRT - VERBOSE Importing initializer: 89 TRT - VERBOSE Importing initializer: 91 TRT - VERBOSE Importing initializer: 504 TRT - VERBOSE Importing initializer: 95 TRT - VERBOSE Importing initializer: 98 TRT - VERBOSE Importing initializer: 104 TRT - VERBOSE Importing initializer: 107 TRT - VERBOSE Importing initializer: 102 TRT - VERBOSE Importing initializer: 111 TRT - VERBOSE Importing initializer: 115 TRT - VERBOSE Importing initializer: 117 TRT - VERBOSE Importing initializer: 505 TRT - VERBOSE Importing initializer: 142 TRT - VERBOSE Importing initializer: 144 TRT - VERBOSE Importing initializer: 146 TRT - VERBOSE Importing initializer: 150 TRT - VERBOSE Importing initializer: 152 TRT - VERBOSE Importing initializer: 149 TRT - VERBOSE Importing initializer: 163 TRT - VERBOSE Importing initializer: 165 TRT - VERBOSE Importing initializer: 170 TRT - VERBOSE Importing initializer: 171 TRT - VERBOSE Importing initializer: 169 TRT - VERBOSE Importing initializer: 178 TRT - VERBOSE Importing initializer: 181 TRT - VERBOSE Importing initializer: 184 TRT - VERBOSE Importing initializer: 190 TRT - VERBOSE Importing initializer: 193 TRT - VERBOSE Importing initializer: 188 TRT - VERBOSE Importing initializer: 197 TRT - VERBOSE Importing initializer: 201 TRT - VERBOSE Importing initializer: 203 TRT - VERBOSE Importing initializer: 511 TRT - VERBOSE Importing initializer: 228 TRT - VERBOSE Importing initializer: 231 TRT - VERBOSE Importing initializer: 235 TRT - VERBOSE Importing initializer: 237 TRT - VERBOSE Importing initializer: 234 TRT - VERBOSE Importing initializer: 248 TRT - VERBOSE Importing initializer: 250 TRT - VERBOSE Importing initializer: 255 TRT - VERBOSE Importing initializer: 256 TRT - VERBOSE Importing initializer: 254 TRT - VERBOSE Importing initializer: 263 TRT - VERBOSE Importing initializer: 266 TRT - VERBOSE Importing initializer: 269 TRT - VERBOSE Importing initializer: 275 TRT - VERBOSE Importing initializer: 278 TRT - VERBOSE Importing initializer: 273 TRT - VERBOSE Importing initializer: 282 TRT - VERBOSE Importing initializer: 286 TRT - VERBOSE Importing initializer: 288 TRT - VERBOSE Importing initializer: 517 TRT - VERBOSE Importing initializer: 313 TRT - VERBOSE Importing initializer: 316 TRT - VERBOSE Importing initializer: 320 TRT - VERBOSE Importing initializer: 322 TRT - VERBOSE Importing initializer: 319 TRT - VERBOSE Importing initializer: 333 TRT - VERBOSE Importing initializer: 335 TRT - VERBOSE Importing initializer: 340 TRT - VERBOSE Importing initializer: 341 TRT - VERBOSE Importing initializer: 339 TRT - VERBOSE Importing initializer: 348 TRT - VERBOSE Importing initializer: 352 TRT - VERBOSE Importing initializer: 355 TRT - VERBOSE Importing initializer: 357 TRT - VERBOSE Importing initializer: 365 TRT - VERBOSE Importing initializer: 367 TRT - VERBOSE Importing initializer: 375 TRT - VERBOSE Importing initializer: 377 TRT - VERBOSE Importing initializer: 523 TRT - VERBOSE Importing initializer: 524 TRT - VERBOSE Importing initializer: 403 TRT - VERBOSE Importing initializer: 404 TRT - VERBOSE Importing initializer: 432 TRT - VERBOSE Importing initializer: 433 TRT - VERBOSE Importing initializer: 435 TRT - VERBOSE Importing initializer: 437 TRT - VERBOSE Importing initializer: 439 TRT - VERBOSE Importing initializer: 445 TRT - VERBOSE Importing initializer: 448 TRT - VERBOSE Importing initializer: 450 TRT - VERBOSE Importing initializer: 462 TRT - VERBOSE Importing initializer: 465 TRT - VERBOSE Importing initializer: 467 TRT - VERBOSE Importing initializer: 469 TRT - VERBOSE Importing initializer: 474 TRT - VERBOSE Importing initializer: 477 TRT - VERBOSE Importing initializer: 479 TRT - VERBOSE Importing initializer: 481 TRT - VERBOSE Parsing node: Shape_0 [Shape] TRT - VERBOSE Searching for input: input TRT - VERBOSE Shape_0 [Shape] inputs: [input -> (-1, -1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_0 for ONNX node: Shape_0 TRT - VERBOSE Registering tensor: 34 for ONNX tensor: 34 TRT - VERBOSE Shape_0 [Shape] outputs: [34 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_2 [Gather] TRT - VERBOSE Searching for input: 34 TRT - VERBOSE Searching for input: 35 TRT - VERBOSE Gather_2 [Gather] inputs: [34 -> (4)[INT32]], [35 -> ()[INT32]], TRT - VERBOSE Registering layer: 35 for ONNX node: 35 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_2 for ONNX node: Gather_2 TRT - VERBOSE Registering tensor: 36 for ONNX tensor: 36 TRT - VERBOSE Gather_2 [Gather] outputs: [36 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_3 [Shape] TRT - VERBOSE Searching for input: input TRT - VERBOSE Shape_3 [Shape] inputs: [input -> (-1, -1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_3 for ONNX node: Shape_3 TRT - VERBOSE Registering tensor: 37 for ONNX tensor: 37 TRT - VERBOSE Shape_3 [Shape] outputs: [37 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_5 [Gather] TRT - VERBOSE Searching for input: 37 TRT - VERBOSE Searching for input: 38 TRT - VERBOSE Gather_5 [Gather] inputs: [37 -> (4)[INT32]], [38 -> ()[INT32]], TRT - VERBOSE Registering layer: 38 for ONNX node: 38 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_5 for ONNX node: Gather_5 TRT - VERBOSE Registering tensor: 39 for ONNX tensor: 39 TRT - VERBOSE Gather_5 [Gather] outputs: [39 -> ()[INT32]], TRT - VERBOSE Parsing node: Conv_6 [Conv] TRT - VERBOSE Searching for input: input TRT - VERBOSE Searching for input: 487 TRT - VERBOSE Searching for input: 488 TRT - VERBOSE Conv_6 [Conv] inputs: [input -> (-1, -1, -1, -1)[FLOAT]], [487 -> (32, 3, 3, 3)[FLOAT]], [488 -> (32)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, -1, -1, -1) TRT - VERBOSE Registering layer: Conv_6 for ONNX node: Conv_6 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 32 TRT - VERBOSE Convolution output dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering tensor: 486 for ONNX tensor: 486 TRT - VERBOSE Conv_6 [Conv] outputs: [486 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_7 [Relu] TRT - VERBOSE Searching for input: 486 TRT - VERBOSE Relu_7 [Relu] inputs: [486 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_7 for ONNX node: Relu_7 TRT - VERBOSE Registering tensor: 44 for ONNX tensor: 44 TRT - VERBOSE Relu_7 [Relu] outputs: [44 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_8 [Conv] TRT - VERBOSE Searching for input: 44 TRT - VERBOSE Searching for input: descriptor_map.layer_01.3.weight TRT - VERBOSE Conv_8 [Conv] inputs: [44 -> (-1, 32, -1, -1)[FLOAT]], [descriptor_map.layer_01.3.weight -> (32, 32, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering layer: Conv_8 for ONNX node: Conv_8 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 32 TRT - VERBOSE Convolution output dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering tensor: 45 for ONNX tensor: 45 TRT - VERBOSE Conv_8 [Conv] outputs: [45 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: BatchNormalization_11 [BatchNormalization] TRT - VERBOSE Searching for input: 45 TRT - VERBOSE Searching for input: 46 TRT - VERBOSE Searching for input: 47 TRT - VERBOSE Searching for input: descriptor_map.layer_23.0.running_mean TRT - VERBOSE Searching for input: descriptor_map.layer_23.0.running_var TRT - VERBOSE BatchNormalization_11 [BatchNormalization] inputs: [45 -> (-1, 32, -1, -1)[FLOAT]], [46 -> (32)[FLOAT]], [47 -> (32)[FLOAT]], [descriptor_map.layer_23.0.running_mean -> (32)[FLOAT]], [descriptor_map.layer_23.0.running_var -> (32)[FLOAT]], TRT - VERBOSE Registering layer: BatchNormalization_11 for ONNX node: BatchNormalization_11 TRT - VERBOSE Registering tensor: 48 for ONNX tensor: 48 TRT - VERBOSE BatchNormalization_11 [BatchNormalization] outputs: [48 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_12 [Relu] TRT - VERBOSE Searching for input: 48 TRT - VERBOSE Relu_12 [Relu] inputs: [48 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_12 for ONNX node: Relu_12 TRT - VERBOSE Registering tensor: 49 for ONNX tensor: 49 TRT - VERBOSE Relu_12 [Relu] outputs: [49 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_13 [Conv] TRT - VERBOSE Searching for input: 49 TRT - VERBOSE Searching for input: 490 TRT - VERBOSE Searching for input: 491 TRT - VERBOSE Conv_13 [Conv] inputs: [49 -> (-1, 32, -1, -1)[FLOAT]], [490 -> (64, 32, 3, 3)[FLOAT]], [491 -> (64)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 32, -1, -1) TRT - VERBOSE Registering layer: Conv_13 for ONNX node: Conv_13 TRT - VERBOSE Using kernel: (3, 3), strides: (2, 2), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 64 TRT - VERBOSE Convolution output dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering tensor: 489 for ONNX tensor: 489 TRT - VERBOSE Conv_13 [Conv] outputs: [489 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_14 [Relu] TRT - VERBOSE Searching for input: 489 TRT - VERBOSE Relu_14 [Relu] inputs: [489 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_14 for ONNX node: Relu_14 TRT - VERBOSE Registering tensor: 54 for ONNX tensor: 54 TRT - VERBOSE Relu_14 [Relu] outputs: [54 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_15 [Conv] TRT - VERBOSE Searching for input: 54 TRT - VERBOSE Searching for input: descriptor_map.layer_23.5.weight TRT - VERBOSE Conv_15 [Conv] inputs: [54 -> (-1, 64, -1, -1)[FLOAT]], [descriptor_map.layer_23.5.weight -> (64, 64, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering layer: Conv_15 for ONNX node: Conv_15 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 64 TRT - VERBOSE Convolution output dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering tensor: 55 for ONNX tensor: 55 TRT - VERBOSE Conv_15 [Conv] outputs: [55 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: BatchNormalization_18 [BatchNormalization] TRT - VERBOSE Searching for input: 55 TRT - VERBOSE Searching for input: 56 TRT - VERBOSE Searching for input: 57 TRT - VERBOSE Searching for input: descriptor_map.layer_45.0.running_mean TRT - VERBOSE Searching for input: descriptor_map.layer_45.0.running_var TRT - VERBOSE BatchNormalization_18 [BatchNormalization] inputs: [55 -> (-1, 64, -1, -1)[FLOAT]], [56 -> (64)[FLOAT]], [57 -> (64)[FLOAT]], [descriptor_map.layer_45.0.running_mean -> (64)[FLOAT]], [descriptor_map.layer_45.0.running_var -> (64)[FLOAT]], TRT - VERBOSE Registering layer: BatchNormalization_18 for ONNX node: BatchNormalization_18 TRT - VERBOSE Registering tensor: 58 for ONNX tensor: 58 TRT - VERBOSE BatchNormalization_18 [BatchNormalization] outputs: [58 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_19 [Relu] TRT - VERBOSE Searching for input: 58 TRT - VERBOSE Relu_19 [Relu] inputs: [58 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_19 for ONNX node: Relu_19 TRT - VERBOSE Registering tensor: 59 for ONNX tensor: 59 TRT - VERBOSE Relu_19 [Relu] outputs: [59 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_20 [Conv] TRT - VERBOSE Searching for input: 59 TRT - VERBOSE Searching for input: 493 TRT - VERBOSE Searching for input: 494 TRT - VERBOSE Conv_20 [Conv] inputs: [59 -> (-1, 64, -1, -1)[FLOAT]], [493 -> (128, 64, 3, 3)[FLOAT]], [494 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 64, -1, -1) TRT - VERBOSE Registering layer: Conv_20 for ONNX node: Conv_20 TRT - VERBOSE Using kernel: (3, 3), strides: (2, 2), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 492 for ONNX tensor: 492 TRT - VERBOSE Conv_20 [Conv] outputs: [492 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_21 [Relu] TRT - VERBOSE Searching for input: 492 TRT - VERBOSE Relu_21 [Relu] inputs: [492 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_21 for ONNX node: Relu_21 TRT - VERBOSE Registering tensor: 64 for ONNX tensor: 64 TRT - VERBOSE Relu_21 [Relu] outputs: [64 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_22 [Conv] TRT - VERBOSE Searching for input: 64 TRT - VERBOSE Searching for input: 496 TRT - VERBOSE Searching for input: 497 TRT - VERBOSE Conv_22 [Conv] inputs: [64 -> (-1, 128, -1, -1)[FLOAT]], [496 -> (128, 128, 3, 3)[FLOAT]], [497 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_22 for ONNX node: Conv_22 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 495 for ONNX tensor: 495 TRT - VERBOSE Conv_22 [Conv] outputs: [495 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_23 [Relu] TRT - VERBOSE Searching for input: 495 TRT - VERBOSE Relu_23 [Relu] inputs: [495 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_23 for ONNX node: Relu_23 TRT - VERBOSE Registering tensor: 69 for ONNX tensor: 69 TRT - VERBOSE Relu_23 [Relu] outputs: [69 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_24 [Conv] TRT - VERBOSE Searching for input: 69 TRT - VERBOSE Searching for input: 499 TRT - VERBOSE Searching for input: 500 TRT - VERBOSE Conv_24 [Conv] inputs: [69 -> (-1, 128, -1, -1)[FLOAT]], [499 -> (128, 128, 3, 3)[FLOAT]], [500 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_24 for ONNX node: Conv_24 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 498 for ONNX tensor: 498 TRT - VERBOSE Conv_24 [Conv] outputs: [498 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_25 [Relu] TRT - VERBOSE Searching for input: 498 TRT - VERBOSE Relu_25 [Relu] inputs: [498 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_25 for ONNX node: Relu_25 TRT - VERBOSE Registering tensor: 74 for ONNX tensor: 74 TRT - VERBOSE Relu_25 [Relu] outputs: [74 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_26 [Conv] TRT - VERBOSE Searching for input: 74 TRT - VERBOSE Searching for input: 502 TRT - VERBOSE Searching for input: 503 TRT - VERBOSE Conv_26 [Conv] inputs: [74 -> (-1, 128, -1, -1)[FLOAT]], [502 -> (128, 128, 3, 3)[FLOAT]], [503 -> (128)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_26 for ONNX node: Conv_26 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: 501 for ONNX tensor: 501 TRT - VERBOSE Conv_26 [Conv] outputs: [501 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Relu_27 [Relu] TRT - VERBOSE Searching for input: 501 TRT - VERBOSE Relu_27 [Relu] inputs: [501 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Relu_27 for ONNX node: Relu_27 TRT - VERBOSE Registering tensor: 79 for ONNX tensor: 79 TRT - VERBOSE Relu_27 [Relu] outputs: [79 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_28 [Conv] TRT - VERBOSE Searching for input: 79 TRT - VERBOSE Searching for input: descriptor_map.layer_678.6.weight TRT - VERBOSE Conv_28 [Conv] inputs: [79 -> (-1, 128, -1, -1)[FLOAT]], [descriptor_map.layer_678.6.weight -> (128, 128, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering layer: Conv_28 for ONNX node: Conv_28 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (1, 1), postpadding: (1, 1), dilations: (1, 1), numOutputs: 128 TRT - VERBOSE Convolution output dimensions: (-1, 128, -1, -1) TRT - VERBOSE Registering tensor: dense_feat_map_82 for ONNX tensor: dense_feat_map TRT - VERBOSE Conv_28 [Conv] outputs: [dense_feat_map -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_29 [Shape] TRT - VERBOSE Searching for input: dense_feat_map TRT - VERBOSE Shape_29 [Shape] inputs: [dense_feat_map -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_29 for ONNX node: Shape_29 TRT - VERBOSE Registering tensor: 81 for ONNX tensor: 81 TRT - VERBOSE Shape_29 [Shape] outputs: [81 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_31 [Gather] TRT - VERBOSE Searching for input: 81 TRT - VERBOSE Searching for input: 82 TRT - VERBOSE Gather_31 [Gather] inputs: [81 -> (4)[INT32]], [82 -> ()[INT32]], TRT - VERBOSE Registering layer: 82 for ONNX node: 82 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_31 for ONNX node: Gather_31 TRT - VERBOSE Registering tensor: 83 for ONNX tensor: 83 TRT - VERBOSE Gather_31 [Gather] outputs: [83 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_33 [Unsqueeze] TRT - VERBOSE Searching for input: 83 TRT - VERBOSE Searching for input: 85 TRT - VERBOSE Unsqueeze_33 [Unsqueeze] inputs: [83 -> ()[INT32]], [85 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_33 for ONNX node: Unsqueeze_33 TRT - VERBOSE Registering tensor: 86 for ONNX tensor: 86 TRT - VERBOSE Unsqueeze_33 [Unsqueeze] outputs: [86 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_35 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 89 TRT - VERBOSE Unsqueeze_35 [Unsqueeze] inputs: [36 -> ()[INT32]], [89 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_35 for ONNX node: Unsqueeze_35 TRT - VERBOSE Registering tensor: 90 for ONNX tensor: 90 TRT - VERBOSE Unsqueeze_35 [Unsqueeze] outputs: [90 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_37 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 91 TRT - VERBOSE Unsqueeze_37 [Unsqueeze] inputs: [39 -> ()[INT32]], [91 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_37 for ONNX node: Unsqueeze_37 TRT - VERBOSE Registering tensor: 92 for ONNX tensor: 92 TRT - VERBOSE Unsqueeze_37 [Unsqueeze] outputs: [92 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_38 [Concat] TRT - VERBOSE Searching for input: 86 TRT - VERBOSE Searching for input: 504 TRT - VERBOSE Searching for input: 90 TRT - VERBOSE Searching for input: 92 TRT - VERBOSE Concat_38 [Concat] inputs: [86 -> (1)[INT32]], [504 -> (1)[INT32]], [90 -> (1)[INT32]], [92 -> (1)[INT32]], TRT - VERBOSE Registering layer: 504 for ONNX node: 504 TRT - VERBOSE Registering layer: Concat_38 for ONNX node: Concat_38 TRT - VERBOSE Registering tensor: 93 for ONNX tensor: 93 TRT - VERBOSE Concat_38 [Concat] outputs: [93 -> (4)[INT32]], TRT - VERBOSE Parsing node: ConstantOfShape_39 [ConstantOfShape] TRT - VERBOSE Searching for input: 93 TRT - VERBOSE ConstantOfShape_39 [ConstantOfShape] inputs: [93 -> (4)[INT32]], TRT - VERBOSE Registering layer: ConstantOfShape_39 for ONNX node: ConstantOfShape_39 TRT - VERBOSE Registering tensor: 94 for ONNX tensor: 94 TRT - VERBOSE ConstantOfShape_39 [ConstantOfShape] outputs: [94 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_41 [Div] TRT - VERBOSE Searching for input: 45 TRT - VERBOSE Searching for input: 95 TRT - VERBOSE Div_41 [Div] inputs: [45 -> (-1, 32, -1, -1)[FLOAT]], [95 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 95 for ONNX node: 95 TRT - VERBOSE Registering layer: Div_41 for ONNX node: Div_41 TRT - VERBOSE Registering tensor: 96 for ONNX tensor: 96 TRT - VERBOSE Div_41 [Div] outputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_42 [Shape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Shape_42 [Shape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_42 for ONNX node: Shape_42 TRT - VERBOSE Registering tensor: 97 for ONNX tensor: 97 TRT - VERBOSE Shape_42 [Shape] outputs: [97 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_44 [Gather] TRT - VERBOSE Searching for input: 97 TRT - VERBOSE Searching for input: 98 TRT - VERBOSE Gather_44 [Gather] inputs: [97 -> (4)[INT32]], [98 -> ()[INT32]], TRT - VERBOSE Registering layer: 98 for ONNX node: 98 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_44 for ONNX node: Gather_44 TRT - VERBOSE Registering tensor: 99 for ONNX tensor: 99 TRT - VERBOSE Gather_44 [Gather] outputs: [99 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_48 [Shape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Shape_48 [Shape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_48 for ONNX node: Shape_48 TRT - VERBOSE Registering tensor: 103 for ONNX tensor: 103 TRT - VERBOSE Shape_48 [Shape] outputs: [103 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_50 [Gather] TRT - VERBOSE Searching for input: 103 TRT - VERBOSE Searching for input: 104 TRT - VERBOSE Gather_50 [Gather] inputs: [103 -> (4)[INT32]], [104 -> ()[INT32]], TRT - VERBOSE Registering layer: 104 for ONNX node: 104 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_50 for ONNX node: Gather_50 TRT - VERBOSE Registering tensor: 105 for ONNX tensor: 105 TRT - VERBOSE Gather_50 [Gather] outputs: [105 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_51 [Shape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Shape_51 [Shape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_51 for ONNX node: Shape_51 TRT - VERBOSE Registering tensor: 106 for ONNX tensor: 106 TRT - VERBOSE Shape_51 [Shape] outputs: [106 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_53 [Gather] TRT - VERBOSE Searching for input: 106 TRT - VERBOSE Searching for input: 107 TRT - VERBOSE Gather_53 [Gather] inputs: [106 -> (4)[INT32]], [107 -> ()[INT32]], TRT - VERBOSE Registering layer: 107 for ONNX node: 107 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_53 for ONNX node: Gather_53 TRT - VERBOSE Registering tensor: 108 for ONNX tensor: 108 TRT - VERBOSE Gather_53 [Gather] outputs: [108 -> ()[INT32]], TRT - VERBOSE Parsing node: Mul_54 [Mul] TRT - VERBOSE Searching for input: 99 TRT - VERBOSE Searching for input: 102 TRT - VERBOSE Mul_54 [Mul] inputs: [99 -> ()[INT32]], [102 -> ()[INT32]], TRT - VERBOSE Registering layer: 102 for ONNX node: 102 TRT - VERBOSE Registering layer: Mul_54 for ONNX node: Mul_54 TRT - VERBOSE Registering tensor: 109 for ONNX tensor: 109 TRT - VERBOSE Mul_54 [Mul] outputs: [109 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_56 [Unsqueeze] TRT - VERBOSE Searching for input: 109 TRT - VERBOSE Searching for input: 111 TRT - VERBOSE Unsqueeze_56 [Unsqueeze] inputs: [109 -> ()[INT32]], [111 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_56 for ONNX node: Unsqueeze_56 TRT - VERBOSE Registering tensor: 112 for ONNX tensor: 112 TRT - VERBOSE Unsqueeze_56 [Unsqueeze] outputs: [112 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_58 [Unsqueeze] TRT - VERBOSE Searching for input: 105 TRT - VERBOSE Searching for input: 115 TRT - VERBOSE Unsqueeze_58 [Unsqueeze] inputs: [105 -> ()[INT32]], [115 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_58 for ONNX node: Unsqueeze_58 TRT - VERBOSE Registering tensor: 116 for ONNX tensor: 116 TRT - VERBOSE Unsqueeze_58 [Unsqueeze] outputs: [116 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_60 [Unsqueeze] TRT - VERBOSE Searching for input: 108 TRT - VERBOSE Searching for input: 117 TRT - VERBOSE Unsqueeze_60 [Unsqueeze] inputs: [108 -> ()[INT32]], [117 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_60 for ONNX node: Unsqueeze_60 TRT - VERBOSE Registering tensor: 118 for ONNX tensor: 118 TRT - VERBOSE Unsqueeze_60 [Unsqueeze] outputs: [118 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_61 [Concat] TRT - VERBOSE Searching for input: 112 TRT - VERBOSE Searching for input: 505 TRT - VERBOSE Searching for input: 116 TRT - VERBOSE Searching for input: 118 TRT - VERBOSE Concat_61 [Concat] inputs: [112 -> (1)[INT32]], [505 -> (1)[INT32]], [116 -> (1)[INT32]], [118 -> (1)[INT32]], TRT - VERBOSE Registering layer: 505 for ONNX node: 505 TRT - VERBOSE Registering layer: Concat_61 for ONNX node: Concat_61 TRT - VERBOSE Registering tensor: 119 for ONNX tensor: 119 TRT - VERBOSE Concat_61 [Concat] outputs: [119 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_62 [Reshape] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Searching for input: 119 TRT - VERBOSE Reshape_62 [Reshape] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], [119 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_62 for ONNX node: Reshape_62 TRT - VERBOSE Registering tensor: 120 for ONNX tensor: 120 TRT - VERBOSE Reshape_62 [Reshape] outputs: [120 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_76 [Pad] TRT - VERBOSE Searching for input: 120 TRT - VERBOSE Searching for input: 142 TRT - VERBOSE Pad_76 [Pad] inputs: [120 -> (-1, 1, -1, -1)[FLOAT]], [142 -> (8)[INT32]], TRT - VERBOSE Registering layer: Pad_76 for ONNX node: Pad_76 TRT - VERBOSE Registering tensor: 143 for ONNX tensor: 143 TRT - VERBOSE Pad_76 [Pad] outputs: [143 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_78 [Conv] TRT - VERBOSE Searching for input: 143 TRT - VERBOSE Searching for input: 144 TRT - VERBOSE Conv_78 [Conv] inputs: [143 -> (-1, 1, -1, -1)[FLOAT]], [144 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_78 for ONNX node: Conv_78 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 145 for ONNX tensor: 145 TRT - VERBOSE Conv_78 [Conv] outputs: [145 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_80 [Unsqueeze] TRT - VERBOSE Searching for input: 99 TRT - VERBOSE Searching for input: 146 TRT - VERBOSE Unsqueeze_80 [Unsqueeze] inputs: [99 -> ()[INT32]], [146 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_80 for ONNX node: Unsqueeze_80 TRT - VERBOSE Registering tensor: 147 for ONNX tensor: 147 TRT - VERBOSE Unsqueeze_80 [Unsqueeze] outputs: [147 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_84 [Unsqueeze] TRT - VERBOSE Searching for input: 105 TRT - VERBOSE Searching for input: 150 TRT - VERBOSE Unsqueeze_84 [Unsqueeze] inputs: [105 -> ()[INT32]], [150 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_84 for ONNX node: Unsqueeze_84 TRT - VERBOSE Registering tensor: 151 for ONNX tensor: 151 TRT - VERBOSE Unsqueeze_84 [Unsqueeze] outputs: [151 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_86 [Unsqueeze] TRT - VERBOSE Searching for input: 108 TRT - VERBOSE Searching for input: 152 TRT - VERBOSE Unsqueeze_86 [Unsqueeze] inputs: [108 -> ()[INT32]], [152 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_86 for ONNX node: Unsqueeze_86 TRT - VERBOSE Registering tensor: 153 for ONNX tensor: 153 TRT - VERBOSE Unsqueeze_86 [Unsqueeze] outputs: [153 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_87 [Concat] TRT - VERBOSE Searching for input: 147 TRT - VERBOSE Searching for input: 149 TRT - VERBOSE Searching for input: 151 TRT - VERBOSE Searching for input: 153 TRT - VERBOSE Concat_87 [Concat] inputs: [147 -> (1)[INT32]], [149 -> (1)[INT32]], [151 -> (1)[INT32]], [153 -> (1)[INT32]], TRT - VERBOSE Registering layer: 149 for ONNX node: 149 TRT - VERBOSE Registering layer: Concat_87 for ONNX node: Concat_87 TRT - VERBOSE Registering tensor: 154 for ONNX tensor: 154 TRT - VERBOSE Concat_87 [Concat] outputs: [154 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_88 [Reshape] TRT - VERBOSE Searching for input: 145 TRT - VERBOSE Searching for input: 154 TRT - VERBOSE Reshape_88 [Reshape] inputs: [145 -> (-1, 1, -1, -1)[FLOAT]], [154 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_88 for ONNX node: Reshape_88 TRT - VERBOSE Registering tensor: 155 for ONNX tensor: 155 TRT - VERBOSE Reshape_88 [Reshape] outputs: [155 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMean_89 [ReduceMean] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE ReduceMean_89 [ReduceMean] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMean_89 for ONNX node: ReduceMean_89 TRT - VERBOSE Registering tensor: 156 for ONNX tensor: 156 TRT - VERBOSE ReduceMean_89 [ReduceMean] outputs: [156 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_90 [Sub] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Searching for input: 155 TRT - VERBOSE Sub_90 [Sub] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], [155 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_90 for ONNX node: Sub_90 TRT - VERBOSE Registering tensor: 157 for ONNX tensor: 157 TRT - VERBOSE Sub_90 [Sub] outputs: [157 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_91 [Softplus] TRT - VERBOSE Searching for input: 157 TRT - VERBOSE Softplus_91 [Softplus] inputs: [157 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_91 for ONNX node: Softplus_91 TRT - VERBOSE Registering tensor: 158 for ONNX tensor: 158 TRT - VERBOSE Softplus_91 [Softplus] outputs: [158 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_92 [Sub] TRT - VERBOSE Searching for input: 96 TRT - VERBOSE Searching for input: 156 TRT - VERBOSE Sub_92 [Sub] inputs: [96 -> (-1, 32, -1, -1)[FLOAT]], [156 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_92 for ONNX node: Sub_92 TRT - VERBOSE Registering tensor: 159 for ONNX tensor: 159 TRT - VERBOSE Sub_92 [Sub] outputs: [159 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_93 [Softplus] TRT - VERBOSE Searching for input: 159 TRT - VERBOSE Softplus_93 [Softplus] inputs: [159 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_93 for ONNX node: Softplus_93 TRT - VERBOSE Registering tensor: 160 for ONNX tensor: 160 TRT - VERBOSE Softplus_93 [Softplus] outputs: [160 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_94 [Mul] TRT - VERBOSE Searching for input: 158 TRT - VERBOSE Searching for input: 160 TRT - VERBOSE Mul_94 [Mul] inputs: [158 -> (-1, 32, -1, -1)[FLOAT]], [160 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_94 for ONNX node: Mul_94 TRT - VERBOSE Registering tensor: 161 for ONNX tensor: 161 TRT - VERBOSE Mul_94 [Mul] outputs: [161 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMax_95 [ReduceMax] TRT - VERBOSE Searching for input: 161 TRT - VERBOSE ReduceMax_95 [ReduceMax] inputs: [161 -> (-1, 32, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMax_95 for ONNX node: ReduceMax_95 TRT - VERBOSE Registering tensor: 162 for ONNX tensor: 162 TRT - VERBOSE ReduceMax_95 [ReduceMax] outputs: [162 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_97 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 163 TRT - VERBOSE Unsqueeze_97 [Unsqueeze] inputs: [36 -> ()[INT32]], [163 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_97 for ONNX node: Unsqueeze_97 TRT - VERBOSE Registering tensor: 164 for ONNX tensor: 164 TRT - VERBOSE Unsqueeze_97 [Unsqueeze] outputs: [164 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_99 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 165 TRT - VERBOSE Unsqueeze_99 [Unsqueeze] inputs: [39 -> ()[INT32]], [165 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_99 for ONNX node: Unsqueeze_99 TRT - VERBOSE Registering tensor: 166 for ONNX tensor: 166 TRT - VERBOSE Unsqueeze_99 [Unsqueeze] outputs: [166 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_100 [Concat] TRT - VERBOSE Searching for input: 164 TRT - VERBOSE Searching for input: 166 TRT - VERBOSE Concat_100 [Concat] inputs: [164 -> (1)[INT32]], [166 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_100 for ONNX node: Concat_100 TRT - VERBOSE Registering tensor: 167 for ONNX tensor: 167 TRT - VERBOSE Concat_100 [Concat] outputs: [167 -> (2)[INT32]], TRT - VERBOSE Parsing node: Shape_101 [Shape] TRT - VERBOSE Searching for input: 162 TRT - VERBOSE Shape_101 [Shape] inputs: [162 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_101 for ONNX node: Shape_101 TRT - VERBOSE Registering tensor: 168 for ONNX tensor: 168 TRT - VERBOSE Shape_101 [Shape] outputs: [168 -> (4)[INT32]], TRT - VERBOSE Parsing node: Slice_105 [Slice] TRT - VERBOSE Searching for input: 168 TRT - VERBOSE Searching for input: 170 TRT - VERBOSE Searching for input: 171 TRT - VERBOSE Searching for input: 169 TRT - VERBOSE Slice_105 [Slice] inputs: [168 -> (4)[INT32]], [170 -> (1)[INT32]], [171 -> (1)[INT32]], [169 -> (1)[INT32]], TRT - VERBOSE Registering layer: Slice_105 for ONNX node: Slice_105 TRT - VERBOSE Registering tensor: 172 for ONNX tensor: 172 TRT - VERBOSE Slice_105 [Slice] outputs: [172 -> (2)[INT32]], TRT - VERBOSE Parsing node: Cast_106 [Cast] TRT - VERBOSE Searching for input: 167 TRT - VERBOSE Cast_106 [Cast] inputs: [167 -> (2)[INT32]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_106 for ONNX node: Cast_106 TRT - VERBOSE Registering tensor: 173 for ONNX tensor: 173 TRT - VERBOSE Cast_106 [Cast] outputs: [173 -> (2)[INT32]], TRT - VERBOSE Parsing node: Concat_107 [Concat] TRT - VERBOSE Searching for input: 172 TRT - VERBOSE Searching for input: 173 TRT - VERBOSE Concat_107 [Concat] inputs: [172 -> (2)[INT32]], [173 -> (2)[INT32]], TRT - VERBOSE Registering layer: Concat_107 for ONNX node: Concat_107 TRT - VERBOSE Registering tensor: 174 for ONNX tensor: 174 TRT - VERBOSE Concat_107 [Concat] outputs: [174 -> (4)[INT32]], TRT - VERBOSE Parsing node: Resize_108 [Resize] TRT - VERBOSE Searching for input: 162 TRT - VERBOSE Searching for input: 174 TRT - VERBOSE Resize_108 [Resize] inputs: [162 -> (-1, 1, -1, -1)[FLOAT]], [optional input, not set], [optional input, not set], [174 -> (4)[INT32]], TRT - VERBOSE Registering layer: Resize_108 for ONNX node: Resize_108 TRT - VERBOSE Registering tensor: 177 for ONNX tensor: 177 TRT - VERBOSE Resize_108 [Resize] outputs: [177 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_110 [Mul] TRT - VERBOSE Searching for input: 178 TRT - VERBOSE Searching for input: 177 TRT - VERBOSE Mul_110 [Mul] inputs: [178 -> ()[FLOAT]], [177 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: 178 for ONNX node: 178 TRT - VERBOSE Registering layer: Mul_110 for ONNX node: Mul_110 TRT - VERBOSE Registering tensor: 179 for ONNX tensor: 179 TRT - VERBOSE Mul_110 [Mul] outputs: [179 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_111 [Add] TRT - VERBOSE Searching for input: 94 TRT - VERBOSE Searching for input: 179 TRT - VERBOSE Add_111 [Add] inputs: [94 -> (-1, 1, -1, -1)[FLOAT]], [179 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_111 for ONNX node: Add_111 TRT - VERBOSE Registering tensor: 180 for ONNX tensor: 180 TRT - VERBOSE Add_111 [Add] outputs: [180 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_113 [Div] TRT - VERBOSE Searching for input: 55 TRT - VERBOSE Searching for input: 181 TRT - VERBOSE Div_113 [Div] inputs: [55 -> (-1, 64, -1, -1)[FLOAT]], [181 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 181 for ONNX node: 181 TRT - VERBOSE Registering layer: Div_113 for ONNX node: Div_113 TRT - VERBOSE Registering tensor: 182 for ONNX tensor: 182 TRT - VERBOSE Div_113 [Div] outputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_114 [Shape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Shape_114 [Shape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_114 for ONNX node: Shape_114 TRT - VERBOSE Registering tensor: 183 for ONNX tensor: 183 TRT - VERBOSE Shape_114 [Shape] outputs: [183 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_116 [Gather] TRT - VERBOSE Searching for input: 183 TRT - VERBOSE Searching for input: 184 TRT - VERBOSE Gather_116 [Gather] inputs: [183 -> (4)[INT32]], [184 -> ()[INT32]], TRT - VERBOSE Registering layer: 184 for ONNX node: 184 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_116 for ONNX node: Gather_116 TRT - VERBOSE Registering tensor: 185 for ONNX tensor: 185 TRT - VERBOSE Gather_116 [Gather] outputs: [185 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_120 [Shape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Shape_120 [Shape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_120 for ONNX node: Shape_120 TRT - VERBOSE Registering tensor: 189 for ONNX tensor: 189 TRT - VERBOSE Shape_120 [Shape] outputs: [189 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_122 [Gather] TRT - VERBOSE Searching for input: 189 TRT - VERBOSE Searching for input: 190 TRT - VERBOSE Gather_122 [Gather] inputs: [189 -> (4)[INT32]], [190 -> ()[INT32]], TRT - VERBOSE Registering layer: 190 for ONNX node: 190 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_122 for ONNX node: Gather_122 TRT - VERBOSE Registering tensor: 191 for ONNX tensor: 191 TRT - VERBOSE Gather_122 [Gather] outputs: [191 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_123 [Shape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Shape_123 [Shape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_123 for ONNX node: Shape_123 TRT - VERBOSE Registering tensor: 192 for ONNX tensor: 192 TRT - VERBOSE Shape_123 [Shape] outputs: [192 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_125 [Gather] TRT - VERBOSE Searching for input: 192 TRT - VERBOSE Searching for input: 193 TRT - VERBOSE Gather_125 [Gather] inputs: [192 -> (4)[INT32]], [193 -> ()[INT32]], TRT - VERBOSE Registering layer: 193 for ONNX node: 193 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_125 for ONNX node: Gather_125 TRT - VERBOSE Registering tensor: 194 for ONNX tensor: 194 TRT - VERBOSE Gather_125 [Gather] outputs: [194 -> ()[INT32]], TRT - VERBOSE Parsing node: Mul_126 [Mul] TRT - VERBOSE Searching for input: 185 TRT - VERBOSE Searching for input: 188 TRT - VERBOSE Mul_126 [Mul] inputs: [185 -> ()[INT32]], [188 -> ()[INT32]], TRT - VERBOSE Registering layer: 188 for ONNX node: 188 TRT - VERBOSE Registering layer: Mul_126 for ONNX node: Mul_126 TRT - VERBOSE Registering tensor: 195 for ONNX tensor: 195 TRT - VERBOSE Mul_126 [Mul] outputs: [195 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_128 [Unsqueeze] TRT - VERBOSE Searching for input: 195 TRT - VERBOSE Searching for input: 197 TRT - VERBOSE Unsqueeze_128 [Unsqueeze] inputs: [195 -> ()[INT32]], [197 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_128 for ONNX node: Unsqueeze_128 TRT - VERBOSE Registering tensor: 198 for ONNX tensor: 198 TRT - VERBOSE Unsqueeze_128 [Unsqueeze] outputs: [198 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_130 [Unsqueeze] TRT - VERBOSE Searching for input: 191 TRT - VERBOSE Searching for input: 201 TRT - VERBOSE Unsqueeze_130 [Unsqueeze] inputs: [191 -> ()[INT32]], [201 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_130 for ONNX node: Unsqueeze_130 TRT - VERBOSE Registering tensor: 202 for ONNX tensor: 202 TRT - VERBOSE Unsqueeze_130 [Unsqueeze] outputs: [202 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_132 [Unsqueeze] TRT - VERBOSE Searching for input: 194 TRT - VERBOSE Searching for input: 203 TRT - VERBOSE Unsqueeze_132 [Unsqueeze] inputs: [194 -> ()[INT32]], [203 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_132 for ONNX node: Unsqueeze_132 TRT - VERBOSE Registering tensor: 204 for ONNX tensor: 204 TRT - VERBOSE Unsqueeze_132 [Unsqueeze] outputs: [204 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_133 [Concat] TRT - VERBOSE Searching for input: 198 TRT - VERBOSE Searching for input: 511 TRT - VERBOSE Searching for input: 202 TRT - VERBOSE Searching for input: 204 TRT - VERBOSE Concat_133 [Concat] inputs: [198 -> (1)[INT32]], [511 -> (1)[INT32]], [202 -> (1)[INT32]], [204 -> (1)[INT32]], TRT - VERBOSE Registering layer: 511 for ONNX node: 511 TRT - VERBOSE Registering layer: Concat_133 for ONNX node: Concat_133 TRT - VERBOSE Registering tensor: 205 for ONNX tensor: 205 TRT - VERBOSE Concat_133 [Concat] outputs: [205 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_134 [Reshape] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Searching for input: 205 TRT - VERBOSE Reshape_134 [Reshape] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], [205 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_134 for ONNX node: Reshape_134 TRT - VERBOSE Registering tensor: 206 for ONNX tensor: 206 TRT - VERBOSE Reshape_134 [Reshape] outputs: [206 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_148 [Pad] TRT - VERBOSE Searching for input: 206 TRT - VERBOSE Searching for input: 228 TRT - VERBOSE Pad_148 [Pad] inputs: [206 -> (-1, 1, -1, -1)[FLOAT]], [228 -> (8)[INT32]], TRT - VERBOSE Registering layer: Pad_148 for ONNX node: Pad_148 TRT - VERBOSE Registering tensor: 229 for ONNX tensor: 229 TRT - VERBOSE Pad_148 [Pad] outputs: [229 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_149 [Conv] TRT - VERBOSE Searching for input: 229 TRT - VERBOSE Searching for input: 144 TRT - VERBOSE Conv_149 [Conv] inputs: [229 -> (-1, 1, -1, -1)[FLOAT]], [144 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_149 for ONNX node: Conv_149 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (2, 2), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 230 for ONNX tensor: 230 TRT - VERBOSE Conv_149 [Conv] outputs: [230 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_151 [Unsqueeze] TRT - VERBOSE Searching for input: 185 TRT - VERBOSE Searching for input: 231 TRT - VERBOSE Unsqueeze_151 [Unsqueeze] inputs: [185 -> ()[INT32]], [231 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_151 for ONNX node: Unsqueeze_151 TRT - VERBOSE Registering tensor: 232 for ONNX tensor: 232 TRT - VERBOSE Unsqueeze_151 [Unsqueeze] outputs: [232 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_155 [Unsqueeze] TRT - VERBOSE Searching for input: 191 TRT - VERBOSE Searching for input: 235 TRT - VERBOSE Unsqueeze_155 [Unsqueeze] inputs: [191 -> ()[INT32]], [235 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_155 for ONNX node: Unsqueeze_155 TRT - VERBOSE Registering tensor: 236 for ONNX tensor: 236 TRT - VERBOSE Unsqueeze_155 [Unsqueeze] outputs: [236 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_157 [Unsqueeze] TRT - VERBOSE Searching for input: 194 TRT - VERBOSE Searching for input: 237 TRT - VERBOSE Unsqueeze_157 [Unsqueeze] inputs: [194 -> ()[INT32]], [237 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_157 for ONNX node: Unsqueeze_157 TRT - VERBOSE Registering tensor: 238 for ONNX tensor: 238 TRT - VERBOSE Unsqueeze_157 [Unsqueeze] outputs: [238 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_158 [Concat] TRT - VERBOSE Searching for input: 232 TRT - VERBOSE Searching for input: 234 TRT - VERBOSE Searching for input: 236 TRT - VERBOSE Searching for input: 238 TRT - VERBOSE Concat_158 [Concat] inputs: [232 -> (1)[INT32]], [234 -> (1)[INT32]], [236 -> (1)[INT32]], [238 -> (1)[INT32]], TRT - VERBOSE Registering layer: 234 for ONNX node: 234 TRT - VERBOSE Registering layer: Concat_158 for ONNX node: Concat_158 TRT - VERBOSE Registering tensor: 239 for ONNX tensor: 239 TRT - VERBOSE Concat_158 [Concat] outputs: [239 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_159 [Reshape] TRT - VERBOSE Searching for input: 230 TRT - VERBOSE Searching for input: 239 TRT - VERBOSE Reshape_159 [Reshape] inputs: [230 -> (-1, 1, -1, -1)[FLOAT]], [239 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_159 for ONNX node: Reshape_159 TRT - VERBOSE Registering tensor: 240 for ONNX tensor: 240 TRT - VERBOSE Reshape_159 [Reshape] outputs: [240 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMean_160 [ReduceMean] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE ReduceMean_160 [ReduceMean] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMean_160 for ONNX node: ReduceMean_160 TRT - VERBOSE Registering tensor: 241 for ONNX tensor: 241 TRT - VERBOSE ReduceMean_160 [ReduceMean] outputs: [241 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_161 [Sub] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Searching for input: 240 TRT - VERBOSE Sub_161 [Sub] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], [240 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_161 for ONNX node: Sub_161 TRT - VERBOSE Registering tensor: 242 for ONNX tensor: 242 TRT - VERBOSE Sub_161 [Sub] outputs: [242 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_162 [Softplus] TRT - VERBOSE Searching for input: 242 TRT - VERBOSE Softplus_162 [Softplus] inputs: [242 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_162 for ONNX node: Softplus_162 TRT - VERBOSE Registering tensor: 243 for ONNX tensor: 243 TRT - VERBOSE Softplus_162 [Softplus] outputs: [243 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_163 [Sub] TRT - VERBOSE Searching for input: 182 TRT - VERBOSE Searching for input: 241 TRT - VERBOSE Sub_163 [Sub] inputs: [182 -> (-1, 64, -1, -1)[FLOAT]], [241 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_163 for ONNX node: Sub_163 TRT - VERBOSE Registering tensor: 244 for ONNX tensor: 244 TRT - VERBOSE Sub_163 [Sub] outputs: [244 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_164 [Softplus] TRT - VERBOSE Searching for input: 244 TRT - VERBOSE Softplus_164 [Softplus] inputs: [244 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_164 for ONNX node: Softplus_164 TRT - VERBOSE Registering tensor: 245 for ONNX tensor: 245 TRT - VERBOSE Softplus_164 [Softplus] outputs: [245 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_165 [Mul] TRT - VERBOSE Searching for input: 243 TRT - VERBOSE Searching for input: 245 TRT - VERBOSE Mul_165 [Mul] inputs: [243 -> (-1, 64, -1, -1)[FLOAT]], [245 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_165 for ONNX node: Mul_165 TRT - VERBOSE Registering tensor: 246 for ONNX tensor: 246 TRT - VERBOSE Mul_165 [Mul] outputs: [246 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMax_166 [ReduceMax] TRT - VERBOSE Searching for input: 246 TRT - VERBOSE ReduceMax_166 [ReduceMax] inputs: [246 -> (-1, 64, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMax_166 for ONNX node: ReduceMax_166 TRT - VERBOSE Registering tensor: 247 for ONNX tensor: 247 TRT - VERBOSE ReduceMax_166 [ReduceMax] outputs: [247 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_168 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 248 TRT - VERBOSE Unsqueeze_168 [Unsqueeze] inputs: [36 -> ()[INT32]], [248 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_168 for ONNX node: Unsqueeze_168 TRT - VERBOSE Registering tensor: 249 for ONNX tensor: 249 TRT - VERBOSE Unsqueeze_168 [Unsqueeze] outputs: [249 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_170 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 250 TRT - VERBOSE Unsqueeze_170 [Unsqueeze] inputs: [39 -> ()[INT32]], [250 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_170 for ONNX node: Unsqueeze_170 TRT - VERBOSE Registering tensor: 251 for ONNX tensor: 251 TRT - VERBOSE Unsqueeze_170 [Unsqueeze] outputs: [251 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_171 [Concat] TRT - VERBOSE Searching for input: 249 TRT - VERBOSE Searching for input: 251 TRT - VERBOSE Concat_171 [Concat] inputs: [249 -> (1)[INT32]], [251 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_171 for ONNX node: Concat_171 TRT - VERBOSE Registering tensor: 252 for ONNX tensor: 252 TRT - VERBOSE Concat_171 [Concat] outputs: [252 -> (2)[INT32]], TRT - VERBOSE Parsing node: Shape_172 [Shape] TRT - VERBOSE Searching for input: 247 TRT - VERBOSE Shape_172 [Shape] inputs: [247 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_172 for ONNX node: Shape_172 TRT - VERBOSE Registering tensor: 253 for ONNX tensor: 253 TRT - VERBOSE Shape_172 [Shape] outputs: [253 -> (4)[INT32]], TRT - VERBOSE Parsing node: Slice_176 [Slice] TRT - VERBOSE Searching for input: 253 TRT - VERBOSE Searching for input: 255 TRT - VERBOSE Searching for input: 256 TRT - VERBOSE Searching for input: 254 TRT - VERBOSE Slice_176 [Slice] inputs: [253 -> (4)[INT32]], [255 -> (1)[INT32]], [256 -> (1)[INT32]], [254 -> (1)[INT32]], TRT - VERBOSE Registering layer: Slice_176 for ONNX node: Slice_176 TRT - VERBOSE Registering tensor: 257 for ONNX tensor: 257 TRT - VERBOSE Slice_176 [Slice] outputs: [257 -> (2)[INT32]], TRT - VERBOSE Parsing node: Cast_177 [Cast] TRT - VERBOSE Searching for input: 252 TRT - VERBOSE Cast_177 [Cast] inputs: [252 -> (2)[INT32]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_177 for ONNX node: Cast_177 TRT - VERBOSE Registering tensor: 258 for ONNX tensor: 258 TRT - VERBOSE Cast_177 [Cast] outputs: [258 -> (2)[INT32]], TRT - VERBOSE Parsing node: Concat_178 [Concat] TRT - VERBOSE Searching for input: 257 TRT - VERBOSE Searching for input: 258 TRT - VERBOSE Concat_178 [Concat] inputs: [257 -> (2)[INT32]], [258 -> (2)[INT32]], TRT - VERBOSE Registering layer: Concat_178 for ONNX node: Concat_178 TRT - VERBOSE Registering tensor: 259 for ONNX tensor: 259 TRT - VERBOSE Concat_178 [Concat] outputs: [259 -> (4)[INT32]], TRT - VERBOSE Parsing node: Resize_179 [Resize] TRT - VERBOSE Searching for input: 247 TRT - VERBOSE Searching for input: 259 TRT - VERBOSE Resize_179 [Resize] inputs: [247 -> (-1, 1, -1, -1)[FLOAT]], [optional input, not set], [optional input, not set], [259 -> (4)[INT32]], TRT - VERBOSE Registering layer: Resize_179 for ONNX node: Resize_179 TRT - VERBOSE Registering tensor: 262 for ONNX tensor: 262 TRT - VERBOSE Resize_179 [Resize] outputs: [262 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_181 [Mul] TRT - VERBOSE Searching for input: 263 TRT - VERBOSE Searching for input: 262 TRT - VERBOSE Mul_181 [Mul] inputs: [263 -> ()[FLOAT]], [262 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: 263 for ONNX node: 263 TRT - VERBOSE Registering layer: Mul_181 for ONNX node: Mul_181 TRT - VERBOSE Registering tensor: 264 for ONNX tensor: 264 TRT - VERBOSE Mul_181 [Mul] outputs: [264 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_182 [Add] TRT - VERBOSE Searching for input: 180 TRT - VERBOSE Searching for input: 264 TRT - VERBOSE Add_182 [Add] inputs: [180 -> (-1, 1, -1, -1)[FLOAT]], [264 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_182 for ONNX node: Add_182 TRT - VERBOSE Registering tensor: 265 for ONNX tensor: 265 TRT - VERBOSE Add_182 [Add] outputs: [265 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_184 [Div] TRT - VERBOSE Searching for input: dense_feat_map TRT - VERBOSE Searching for input: 266 TRT - VERBOSE Div_184 [Div] inputs: [dense_feat_map -> (-1, 128, -1, -1)[FLOAT]], [266 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 266 for ONNX node: 266 TRT - VERBOSE Registering layer: Div_184 for ONNX node: Div_184 TRT - VERBOSE Registering tensor: 267 for ONNX tensor: 267 TRT - VERBOSE Div_184 [Div] outputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_185 [Shape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Shape_185 [Shape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_185 for ONNX node: Shape_185 TRT - VERBOSE Registering tensor: 268 for ONNX tensor: 268 TRT - VERBOSE Shape_185 [Shape] outputs: [268 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_187 [Gather] TRT - VERBOSE Searching for input: 268 TRT - VERBOSE Searching for input: 269 TRT - VERBOSE Gather_187 [Gather] inputs: [268 -> (4)[INT32]], [269 -> ()[INT32]], TRT - VERBOSE Registering layer: 269 for ONNX node: 269 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_187 for ONNX node: Gather_187 TRT - VERBOSE Registering tensor: 270 for ONNX tensor: 270 TRT - VERBOSE Gather_187 [Gather] outputs: [270 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_191 [Shape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Shape_191 [Shape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_191 for ONNX node: Shape_191 TRT - VERBOSE Registering tensor: 274 for ONNX tensor: 274 TRT - VERBOSE Shape_191 [Shape] outputs: [274 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_193 [Gather] TRT - VERBOSE Searching for input: 274 TRT - VERBOSE Searching for input: 275 TRT - VERBOSE Gather_193 [Gather] inputs: [274 -> (4)[INT32]], [275 -> ()[INT32]], TRT - VERBOSE Registering layer: 275 for ONNX node: 275 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_193 for ONNX node: Gather_193 TRT - VERBOSE Registering tensor: 276 for ONNX tensor: 276 TRT - VERBOSE Gather_193 [Gather] outputs: [276 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_194 [Shape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Shape_194 [Shape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_194 for ONNX node: Shape_194 TRT - VERBOSE Registering tensor: 277 for ONNX tensor: 277 TRT - VERBOSE Shape_194 [Shape] outputs: [277 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_196 [Gather] TRT - VERBOSE Searching for input: 277 TRT - VERBOSE Searching for input: 278 TRT - VERBOSE Gather_196 [Gather] inputs: [277 -> (4)[INT32]], [278 -> ()[INT32]], TRT - VERBOSE Registering layer: 278 for ONNX node: 278 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_196 for ONNX node: Gather_196 TRT - VERBOSE Registering tensor: 279 for ONNX tensor: 279 TRT - VERBOSE Gather_196 [Gather] outputs: [279 -> ()[INT32]], TRT - VERBOSE Parsing node: Mul_197 [Mul] TRT - VERBOSE Searching for input: 270 TRT - VERBOSE Searching for input: 273 TRT - VERBOSE Mul_197 [Mul] inputs: [270 -> ()[INT32]], [273 -> ()[INT32]], TRT - VERBOSE Registering layer: 273 for ONNX node: 273 TRT - VERBOSE Registering layer: Mul_197 for ONNX node: Mul_197 TRT - VERBOSE Registering tensor: 280 for ONNX tensor: 280 TRT - VERBOSE Mul_197 [Mul] outputs: [280 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_199 [Unsqueeze] TRT - VERBOSE Searching for input: 280 TRT - VERBOSE Searching for input: 282 TRT - VERBOSE Unsqueeze_199 [Unsqueeze] inputs: [280 -> ()[INT32]], [282 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_199 for ONNX node: Unsqueeze_199 TRT - VERBOSE Registering tensor: 283 for ONNX tensor: 283 TRT - VERBOSE Unsqueeze_199 [Unsqueeze] outputs: [283 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_201 [Unsqueeze] TRT - VERBOSE Searching for input: 276 TRT - VERBOSE Searching for input: 286 TRT - VERBOSE Unsqueeze_201 [Unsqueeze] inputs: [276 -> ()[INT32]], [286 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_201 for ONNX node: Unsqueeze_201 TRT - VERBOSE Registering tensor: 287 for ONNX tensor: 287 TRT - VERBOSE Unsqueeze_201 [Unsqueeze] outputs: [287 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_203 [Unsqueeze] TRT - VERBOSE Searching for input: 279 TRT - VERBOSE Searching for input: 288 TRT - VERBOSE Unsqueeze_203 [Unsqueeze] inputs: [279 -> ()[INT32]], [288 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_203 for ONNX node: Unsqueeze_203 TRT - VERBOSE Registering tensor: 289 for ONNX tensor: 289 TRT - VERBOSE Unsqueeze_203 [Unsqueeze] outputs: [289 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_204 [Concat] TRT - VERBOSE Searching for input: 283 TRT - VERBOSE Searching for input: 517 TRT - VERBOSE Searching for input: 287 TRT - VERBOSE Searching for input: 289 TRT - VERBOSE Concat_204 [Concat] inputs: [283 -> (1)[INT32]], [517 -> (1)[INT32]], [287 -> (1)[INT32]], [289 -> (1)[INT32]], TRT - VERBOSE Registering layer: 517 for ONNX node: 517 TRT - VERBOSE Registering layer: Concat_204 for ONNX node: Concat_204 TRT - VERBOSE Registering tensor: 290 for ONNX tensor: 290 TRT - VERBOSE Concat_204 [Concat] outputs: [290 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_205 [Reshape] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Searching for input: 290 TRT - VERBOSE Reshape_205 [Reshape] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], [290 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_205 for ONNX node: Reshape_205 TRT - VERBOSE Registering tensor: 291 for ONNX tensor: 291 TRT - VERBOSE Reshape_205 [Reshape] outputs: [291 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_219 [Pad] TRT - VERBOSE Searching for input: 291 TRT - VERBOSE Searching for input: 313 TRT - VERBOSE Pad_219 [Pad] inputs: [291 -> (-1, 1, -1, -1)[FLOAT]], [313 -> (8)[INT32]], TRT - VERBOSE Registering layer: Pad_219 for ONNX node: Pad_219 TRT - VERBOSE Registering tensor: 314 for ONNX tensor: 314 TRT - VERBOSE Pad_219 [Pad] outputs: [314 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_220 [Conv] TRT - VERBOSE Searching for input: 314 TRT - VERBOSE Searching for input: 144 TRT - VERBOSE Conv_220 [Conv] inputs: [314 -> (-1, 1, -1, -1)[FLOAT]], [144 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_220 for ONNX node: Conv_220 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (1, 1), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 315 for ONNX tensor: 315 TRT - VERBOSE Conv_220 [Conv] outputs: [315 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_222 [Unsqueeze] TRT - VERBOSE Searching for input: 270 TRT - VERBOSE Searching for input: 316 TRT - VERBOSE Unsqueeze_222 [Unsqueeze] inputs: [270 -> ()[INT32]], [316 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_222 for ONNX node: Unsqueeze_222 TRT - VERBOSE Registering tensor: 317 for ONNX tensor: 317 TRT - VERBOSE Unsqueeze_222 [Unsqueeze] outputs: [317 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_226 [Unsqueeze] TRT - VERBOSE Searching for input: 276 TRT - VERBOSE Searching for input: 320 TRT - VERBOSE Unsqueeze_226 [Unsqueeze] inputs: [276 -> ()[INT32]], [320 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_226 for ONNX node: Unsqueeze_226 TRT - VERBOSE Registering tensor: 321 for ONNX tensor: 321 TRT - VERBOSE Unsqueeze_226 [Unsqueeze] outputs: [321 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_228 [Unsqueeze] TRT - VERBOSE Searching for input: 279 TRT - VERBOSE Searching for input: 322 TRT - VERBOSE Unsqueeze_228 [Unsqueeze] inputs: [279 -> ()[INT32]], [322 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_228 for ONNX node: Unsqueeze_228 TRT - VERBOSE Registering tensor: 323 for ONNX tensor: 323 TRT - VERBOSE Unsqueeze_228 [Unsqueeze] outputs: [323 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_229 [Concat] TRT - VERBOSE Searching for input: 317 TRT - VERBOSE Searching for input: 319 TRT - VERBOSE Searching for input: 321 TRT - VERBOSE Searching for input: 323 TRT - VERBOSE Concat_229 [Concat] inputs: [317 -> (1)[INT32]], [319 -> (1)[INT32]], [321 -> (1)[INT32]], [323 -> (1)[INT32]], TRT - VERBOSE Registering layer: 319 for ONNX node: 319 TRT - VERBOSE Registering layer: Concat_229 for ONNX node: Concat_229 TRT - VERBOSE Registering tensor: 324 for ONNX tensor: 324 TRT - VERBOSE Concat_229 [Concat] outputs: [324 -> (4)[INT32]], TRT - VERBOSE Parsing node: Reshape_230 [Reshape] TRT - VERBOSE Searching for input: 315 TRT - VERBOSE Searching for input: 324 TRT - VERBOSE Reshape_230 [Reshape] inputs: [315 -> (-1, 1, -1, -1)[FLOAT]], [324 -> (4)[INT32]], TRT - VERBOSE Registering layer: Reshape_230 for ONNX node: Reshape_230 TRT - VERBOSE Registering tensor: 325 for ONNX tensor: 325 TRT - VERBOSE Reshape_230 [Reshape] outputs: [325 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMean_231 [ReduceMean] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE ReduceMean_231 [ReduceMean] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMean_231 for ONNX node: ReduceMean_231 TRT - VERBOSE Registering tensor: 326 for ONNX tensor: 326 TRT - VERBOSE ReduceMean_231 [ReduceMean] outputs: [326 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_232 [Sub] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Searching for input: 325 TRT - VERBOSE Sub_232 [Sub] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], [325 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_232 for ONNX node: Sub_232 TRT - VERBOSE Registering tensor: 327 for ONNX tensor: 327 TRT - VERBOSE Sub_232 [Sub] outputs: [327 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_233 [Softplus] TRT - VERBOSE Searching for input: 327 TRT - VERBOSE Softplus_233 [Softplus] inputs: [327 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_233 for ONNX node: Softplus_233 TRT - VERBOSE Registering tensor: 328 for ONNX tensor: 328 TRT - VERBOSE Softplus_233 [Softplus] outputs: [328 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_234 [Sub] TRT - VERBOSE Searching for input: 267 TRT - VERBOSE Searching for input: 326 TRT - VERBOSE Sub_234 [Sub] inputs: [267 -> (-1, 128, -1, -1)[FLOAT]], [326 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_234 for ONNX node: Sub_234 TRT - VERBOSE Registering tensor: 329 for ONNX tensor: 329 TRT - VERBOSE Sub_234 [Sub] outputs: [329 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Softplus_235 [Softplus] TRT - VERBOSE Searching for input: 329 TRT - VERBOSE Softplus_235 [Softplus] inputs: [329 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Softplus_235 for ONNX node: Softplus_235 TRT - VERBOSE Registering tensor: 330 for ONNX tensor: 330 TRT - VERBOSE Softplus_235 [Softplus] outputs: [330 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_236 [Mul] TRT - VERBOSE Searching for input: 328 TRT - VERBOSE Searching for input: 330 TRT - VERBOSE Mul_236 [Mul] inputs: [328 -> (-1, 128, -1, -1)[FLOAT]], [330 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_236 for ONNX node: Mul_236 TRT - VERBOSE Registering tensor: 331 for ONNX tensor: 331 TRT - VERBOSE Mul_236 [Mul] outputs: [331 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: ReduceMax_237 [ReduceMax] TRT - VERBOSE Searching for input: 331 TRT - VERBOSE ReduceMax_237 [ReduceMax] inputs: [331 -> (-1, 128, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: ReduceMax_237 for ONNX node: ReduceMax_237 TRT - VERBOSE Registering tensor: 332 for ONNX tensor: 332 TRT - VERBOSE ReduceMax_237 [ReduceMax] outputs: [332 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Unsqueeze_239 [Unsqueeze] TRT - VERBOSE Searching for input: 36 TRT - VERBOSE Searching for input: 333 TRT - VERBOSE Unsqueeze_239 [Unsqueeze] inputs: [36 -> ()[INT32]], [333 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_239 for ONNX node: Unsqueeze_239 TRT - VERBOSE Registering tensor: 334 for ONNX tensor: 334 TRT - VERBOSE Unsqueeze_239 [Unsqueeze] outputs: [334 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_241 [Unsqueeze] TRT - VERBOSE Searching for input: 39 TRT - VERBOSE Searching for input: 335 TRT - VERBOSE Unsqueeze_241 [Unsqueeze] inputs: [39 -> ()[INT32]], [335 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_241 for ONNX node: Unsqueeze_241 TRT - VERBOSE Registering tensor: 336 for ONNX tensor: 336 TRT - VERBOSE Unsqueeze_241 [Unsqueeze] outputs: [336 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_242 [Concat] TRT - VERBOSE Searching for input: 334 TRT - VERBOSE Searching for input: 336 TRT - VERBOSE Concat_242 [Concat] inputs: [334 -> (1)[INT32]], [336 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_242 for ONNX node: Concat_242 TRT - VERBOSE Registering tensor: 337 for ONNX tensor: 337 TRT - VERBOSE Concat_242 [Concat] outputs: [337 -> (2)[INT32]], TRT - VERBOSE Parsing node: Shape_243 [Shape] TRT - VERBOSE Searching for input: 332 TRT - VERBOSE Shape_243 [Shape] inputs: [332 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_243 for ONNX node: Shape_243 TRT - VERBOSE Registering tensor: 338 for ONNX tensor: 338 TRT - VERBOSE Shape_243 [Shape] outputs: [338 -> (4)[INT32]], TRT - VERBOSE Parsing node: Slice_247 [Slice] TRT - VERBOSE Searching for input: 338 TRT - VERBOSE Searching for input: 340 TRT - VERBOSE Searching for input: 341 TRT - VERBOSE Searching for input: 339 TRT - VERBOSE Slice_247 [Slice] inputs: [338 -> (4)[INT32]], [340 -> (1)[INT32]], [341 -> (1)[INT32]], [339 -> (1)[INT32]], TRT - VERBOSE Registering layer: Slice_247 for ONNX node: Slice_247 TRT - VERBOSE Registering tensor: 342 for ONNX tensor: 342 TRT - VERBOSE Slice_247 [Slice] outputs: [342 -> (2)[INT32]], TRT - VERBOSE Parsing node: Cast_248 [Cast] TRT - VERBOSE Searching for input: 337 TRT - VERBOSE Cast_248 [Cast] inputs: [337 -> (2)[INT32]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_248 for ONNX node: Cast_248 TRT - VERBOSE Registering tensor: 343 for ONNX tensor: 343 TRT - VERBOSE Cast_248 [Cast] outputs: [343 -> (2)[INT32]], TRT - VERBOSE Parsing node: Concat_249 [Concat] TRT - VERBOSE Searching for input: 342 TRT - VERBOSE Searching for input: 343 TRT - VERBOSE Concat_249 [Concat] inputs: [342 -> (2)[INT32]], [343 -> (2)[INT32]], TRT - VERBOSE Registering layer: Concat_249 for ONNX node: Concat_249 TRT - VERBOSE Registering tensor: 344 for ONNX tensor: 344 TRT - VERBOSE Concat_249 [Concat] outputs: [344 -> (4)[INT32]], TRT - VERBOSE Parsing node: Resize_250 [Resize] TRT - VERBOSE Searching for input: 332 TRT - VERBOSE Searching for input: 344 TRT - VERBOSE Resize_250 [Resize] inputs: [332 -> (-1, 1, -1, -1)[FLOAT]], [optional input, not set], [optional input, not set], [344 -> (4)[INT32]], TRT - VERBOSE Registering layer: Resize_250 for ONNX node: Resize_250 TRT - VERBOSE Registering tensor: 347 for ONNX tensor: 347 TRT - VERBOSE Resize_250 [Resize] outputs: [347 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_252 [Mul] TRT - VERBOSE Searching for input: 348 TRT - VERBOSE Searching for input: 347 TRT - VERBOSE Mul_252 [Mul] inputs: [348 -> ()[FLOAT]], [347 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: 348 for ONNX node: 348 TRT - VERBOSE Registering layer: Mul_252 for ONNX node: Mul_252 TRT - VERBOSE Registering tensor: 349 for ONNX tensor: 349 TRT - VERBOSE Mul_252 [Mul] outputs: [349 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_253 [Add] TRT - VERBOSE Searching for input: 265 TRT - VERBOSE Searching for input: 349 TRT - VERBOSE Add_253 [Add] inputs: [265 -> (-1, 1, -1, -1)[FLOAT]], [349 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_253 for ONNX node: Add_253 TRT - VERBOSE Registering tensor: 350 for ONNX tensor: 350 TRT - VERBOSE Add_253 [Add] outputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Shape_254 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_254 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_254 for ONNX node: Shape_254 TRT - VERBOSE Registering tensor: 351 for ONNX tensor: 351 TRT - VERBOSE Shape_254 [Shape] outputs: [351 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_256 [Gather] TRT - VERBOSE Searching for input: 351 TRT - VERBOSE Searching for input: 352 TRT - VERBOSE Gather_256 [Gather] inputs: [351 -> (4)[INT32]], [352 -> ()[INT32]], TRT - VERBOSE Registering layer: 352 for ONNX node: 352 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_256 for ONNX node: Gather_256 TRT - VERBOSE Registering tensor: 353 for ONNX tensor: 353 TRT - VERBOSE Gather_256 [Gather] outputs: [353 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_257 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_257 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_257 for ONNX node: Shape_257 TRT - VERBOSE Registering tensor: 354 for ONNX tensor: 354 TRT - VERBOSE Shape_257 [Shape] outputs: [354 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_259 [Gather] TRT - VERBOSE Searching for input: 354 TRT - VERBOSE Searching for input: 355 TRT - VERBOSE Gather_259 [Gather] inputs: [354 -> (4)[INT32]], [355 -> ()[INT32]], TRT - VERBOSE Registering layer: 355 for ONNX node: 355 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_259 for ONNX node: Gather_259 TRT - VERBOSE Registering tensor: 356 for ONNX tensor: 356 TRT - VERBOSE Gather_259 [Gather] outputs: [356 -> ()[INT32]], TRT - VERBOSE Parsing node: Greater_261 [Greater] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 357 TRT - VERBOSE Greater_261 [Greater] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [357 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 357 for ONNX node: 357 TRT - VERBOSE Registering layer: Greater_261 for ONNX node: Greater_261 TRT - VERBOSE Registering tensor: 358 for ONNX tensor: 358 TRT - VERBOSE Greater_261 [Greater] outputs: [358 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: MaxPool_262 [MaxPool] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE MaxPool_262 [MaxPool] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: MaxPool_262 for ONNX node: MaxPool_262 TRT - VERBOSE Registering tensor: 359 for ONNX tensor: 359 TRT - VERBOSE MaxPool_262 [MaxPool] outputs: [359 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Equal_263 [Equal] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 359 TRT - VERBOSE Equal_263 [Equal] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [359 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Equal_263 for ONNX node: Equal_263 TRT - VERBOSE Registering tensor: 360 for ONNX tensor: 360 TRT - VERBOSE Equal_263 [Equal] outputs: [360 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_264 [Cast] TRT - VERBOSE Searching for input: 360 TRT - VERBOSE Cast_264 [Cast] inputs: [360 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_264 for ONNX node: Cast_264 TRT - VERBOSE Registering tensor: 361 for ONNX tensor: 361 TRT - VERBOSE Cast_264 [Cast] outputs: [361 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_265 [Cast] TRT - VERBOSE Searching for input: 358 TRT - VERBOSE Cast_265 [Cast] inputs: [358 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_265 for ONNX node: Cast_265 TRT - VERBOSE Registering tensor: 362 for ONNX tensor: 362 TRT - VERBOSE Cast_265 [Cast] outputs: [362 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_266 [And] TRT - VERBOSE Searching for input: 361 TRT - VERBOSE Searching for input: 362 TRT - VERBOSE And_266 [And] inputs: [361 -> (-1, 1, -1, -1)[BOOL]], [362 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_266 for ONNX node: And_266 TRT - VERBOSE Registering tensor: 363 for ONNX tensor: 363 TRT - VERBOSE And_266 [And] outputs: [363 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_267 [Cast] TRT - VERBOSE Searching for input: 363 TRT - VERBOSE Cast_267 [Cast] inputs: [363 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_267 for ONNX node: Cast_267 TRT - VERBOSE Registering tensor: 364 for ONNX tensor: 364 TRT - VERBOSE Cast_267 [Cast] outputs: [364 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Sub_269 [Sub] TRT - VERBOSE Searching for input: 353 TRT - VERBOSE Searching for input: 365 TRT - VERBOSE Sub_269 [Sub] inputs: [353 -> ()[INT32]], [365 -> ()[INT32]], TRT - VERBOSE Registering layer: 365 for ONNX node: 365 TRT - VERBOSE Registering layer: Sub_269 for ONNX node: Sub_269 TRT - VERBOSE Registering tensor: 366 for ONNX tensor: 366 TRT - VERBOSE Sub_269 [Sub] outputs: [366 -> ()[INT32]], TRT - VERBOSE Parsing node: Sub_271 [Sub] TRT - VERBOSE Searching for input: 356 TRT - VERBOSE Searching for input: 367 TRT - VERBOSE Sub_271 [Sub] inputs: [356 -> ()[INT32]], [367 -> ()[INT32]], TRT - VERBOSE Registering layer: 367 for ONNX node: 367 TRT - VERBOSE Registering layer: Sub_271 for ONNX node: Sub_271 TRT - VERBOSE Registering tensor: 368 for ONNX tensor: 368 TRT - VERBOSE Sub_271 [Sub] outputs: [368 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_273 [Unsqueeze] TRT - VERBOSE Searching for input: 366 TRT - VERBOSE Searching for input: 375 TRT - VERBOSE Unsqueeze_273 [Unsqueeze] inputs: [366 -> ()[INT32]], [375 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_273 for ONNX node: Unsqueeze_273 TRT - VERBOSE Registering tensor: 376 for ONNX tensor: 376 TRT - VERBOSE Unsqueeze_273 [Unsqueeze] outputs: [376 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_275 [Unsqueeze] TRT - VERBOSE Searching for input: 368 TRT - VERBOSE Searching for input: 377 TRT - VERBOSE Unsqueeze_275 [Unsqueeze] inputs: [368 -> ()[INT32]], [377 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_275 for ONNX node: Unsqueeze_275 TRT - VERBOSE Registering tensor: 378 for ONNX tensor: 378 TRT - VERBOSE Unsqueeze_275 [Unsqueeze] outputs: [378 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_276 [Concat] TRT - VERBOSE Searching for input: 523 TRT - VERBOSE Searching for input: 524 TRT - VERBOSE Searching for input: 376 TRT - VERBOSE Searching for input: 378 TRT - VERBOSE Concat_276 [Concat] inputs: [523 -> (1)[INT32]], [524 -> (1)[INT32]], [376 -> (1)[INT32]], [378 -> (1)[INT32]], TRT - VERBOSE Registering layer: 523 for ONNX node: 523 TRT - VERBOSE Registering layer: 524 for ONNX node: 524 TRT - VERBOSE Registering layer: Concat_276 for ONNX node: Concat_276 TRT - VERBOSE Registering tensor: 379 for ONNX tensor: 379 TRT - VERBOSE Concat_276 [Concat] outputs: [379 -> (4)[INT32]], TRT - VERBOSE Parsing node: ConstantOfShape_277 [ConstantOfShape] TRT - VERBOSE Searching for input: 379 TRT - VERBOSE ConstantOfShape_277 [ConstantOfShape] inputs: [379 -> (4)[INT32]], TRT - VERBOSE Registering layer: ConstantOfShape_277 for ONNX node: ConstantOfShape_277 TRT - VERBOSE Registering tensor: 380 for ONNX tensor: 380 TRT - VERBOSE ConstantOfShape_277 [ConstantOfShape] outputs: [380 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Cast_278 [Cast] TRT - VERBOSE Searching for input: 380 TRT - VERBOSE Cast_278 [Cast] inputs: [380 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Casting to type: float32 TRT - VERBOSE Registering layer: Cast_278 for ONNX node: Cast_278 TRT - VERBOSE Registering tensor: 381 for ONNX tensor: 381 TRT - VERBOSE Cast_278 [Cast] outputs: [381 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pad_293 [Pad] TRT - VERBOSE Searching for input: 381 TRT - VERBOSE Searching for input: 403 TRT - VERBOSE Searching for input: 404 TRT - VERBOSE Pad_293 [Pad] inputs: [381 -> (1, 1, -1, -1)[FLOAT]], [403 -> (8)[INT32]], [404 -> ()[FLOAT]], TRT - VERBOSE Registering layer: Pad_293 for ONNX node: Pad_293 TRT - VERBOSE Registering tensor: 405 for ONNX tensor: 405 TRT - VERBOSE Pad_293 [Pad] outputs: [405 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Cast_294 [Cast] TRT - VERBOSE Searching for input: 405 TRT - VERBOSE Cast_294 [Cast] inputs: [405 -> (1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_294 for ONNX node: Cast_294 TRT - VERBOSE Registering tensor: 406 for ONNX tensor: 406 TRT - VERBOSE Cast_294 [Cast] outputs: [406 -> (1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_295 [Cast] TRT - VERBOSE Searching for input: 406 TRT - VERBOSE Cast_295 [Cast] inputs: [406 -> (1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_295 for ONNX node: Cast_295 TRT - VERBOSE Registering tensor: 407 for ONNX tensor: 407 TRT - VERBOSE Cast_295 [Cast] outputs: [407 -> (1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_296 [Cast] TRT - VERBOSE Searching for input: 364 TRT - VERBOSE Cast_296 [Cast] inputs: [364 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_296 for ONNX node: Cast_296 TRT - VERBOSE Registering tensor: 408 for ONNX tensor: 408 TRT - VERBOSE Cast_296 [Cast] outputs: [408 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_297 [And] TRT - VERBOSE Searching for input: 407 TRT - VERBOSE Searching for input: 408 TRT - VERBOSE And_297 [And] inputs: [407 -> (1, 1, -1, -1)[BOOL]], [408 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_297 for ONNX node: And_297 TRT - VERBOSE Registering tensor: 409 for ONNX tensor: 409 TRT - VERBOSE And_297 [And] outputs: [409 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_298 [Cast] TRT - VERBOSE Searching for input: 409 TRT - VERBOSE Cast_298 [Cast] inputs: [409 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_298 for ONNX node: Cast_298 TRT - VERBOSE Registering tensor: 410 for ONNX tensor: 410 TRT - VERBOSE Cast_298 [Cast] outputs: [410 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Pad_313 [Pad] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 432 TRT - VERBOSE Searching for input: 433 TRT - VERBOSE Pad_313 [Pad] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [432 -> (8)[INT32]], [433 -> ()[FLOAT]], TRT - VERBOSE Registering layer: Pad_313 for ONNX node: Pad_313 TRT - VERBOSE Registering tensor: 434 for ONNX tensor: 434 TRT - VERBOSE Pad_313 [Pad] outputs: [434 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_315 [Conv] TRT - VERBOSE Searching for input: 434 TRT - VERBOSE Searching for input: 435 TRT - VERBOSE Conv_315 [Conv] inputs: [434 -> (-1, 1, -1, -1)[FLOAT]], [435 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_315 for ONNX node: Conv_315 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 436 for ONNX tensor: 436 TRT - VERBOSE Conv_315 [Conv] outputs: [436 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_317 [Conv] TRT - VERBOSE Searching for input: 434 TRT - VERBOSE Searching for input: 437 TRT - VERBOSE Conv_317 [Conv] inputs: [434 -> (-1, 1, -1, -1)[FLOAT]], [437 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_317 for ONNX node: Conv_317 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 438 for ONNX tensor: 438 TRT - VERBOSE Conv_317 [Conv] outputs: [438 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Conv_319 [Conv] TRT - VERBOSE Searching for input: 434 TRT - VERBOSE Searching for input: 439 TRT - VERBOSE Conv_319 [Conv] inputs: [434 -> (-1, 1, -1, -1)[FLOAT]], [439 -> (1, 1, 3, 3)[FLOAT]], TRT - VERBOSE Convolution input dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering layer: Conv_319 for ONNX node: Conv_319 TRT - VERBOSE Using kernel: (3, 3), strides: (1, 1), prepadding: (0, 0), postpadding: (0, 0), dilations: (3, 3), numOutputs: 1 TRT - VERBOSE Convolution output dimensions: (-1, 1, -1, -1) TRT - VERBOSE Registering tensor: 440 for ONNX tensor: 440 TRT - VERBOSE Conv_319 [Conv] outputs: [440 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_320 [Mul] TRT - VERBOSE Searching for input: 436 TRT - VERBOSE Searching for input: 440 TRT - VERBOSE Mul_320 [Mul] inputs: [436 -> (-1, 1, -1, -1)[FLOAT]], [440 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_320 for ONNX node: Mul_320 TRT - VERBOSE Registering tensor: 441 for ONNX tensor: 441 TRT - VERBOSE Mul_320 [Mul] outputs: [441 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Mul_321 [Mul] TRT - VERBOSE Searching for input: 438 TRT - VERBOSE Searching for input: 438 TRT - VERBOSE Mul_321 [Mul] inputs: [438 -> (-1, 1, -1, -1)[FLOAT]], [438 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Mul_321 for ONNX node: Mul_321 TRT - VERBOSE Registering tensor: 442 for ONNX tensor: 442 TRT - VERBOSE Mul_321 [Mul] outputs: [442 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Sub_322 [Sub] TRT - VERBOSE Searching for input: 441 TRT - VERBOSE Searching for input: 442 TRT - VERBOSE Sub_322 [Sub] inputs: [441 -> (-1, 1, -1, -1)[FLOAT]], [442 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Sub_322 for ONNX node: Sub_322 TRT - VERBOSE Registering tensor: 443 for ONNX tensor: 443 TRT - VERBOSE Sub_322 [Sub] outputs: [443 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Add_323 [Add] TRT - VERBOSE Searching for input: 436 TRT - VERBOSE Searching for input: 440 TRT - VERBOSE Add_323 [Add] inputs: [436 -> (-1, 1, -1, -1)[FLOAT]], [440 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Add_323 for ONNX node: Add_323 TRT - VERBOSE Registering tensor: 444 for ONNX tensor: 444 TRT - VERBOSE Add_323 [Add] outputs: [444 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Pow_325 [Pow] TRT - VERBOSE Searching for input: 444 TRT - VERBOSE Searching for input: 445 TRT - VERBOSE Pow_325 [Pow] inputs: [444 -> (-1, 1, -1, -1)[FLOAT]], [445 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 445 for ONNX node: 445 TRT - VERBOSE Registering layer: Pow_325 for ONNX node: Pow_325 TRT - VERBOSE Registering tensor: 446 for ONNX tensor: 446 TRT - VERBOSE Pow_325 [Pow] outputs: [446 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Div_326 [Div] TRT - VERBOSE Searching for input: 446 TRT - VERBOSE Searching for input: 443 TRT - VERBOSE Div_326 [Div] inputs: [446 -> (-1, 1, -1, -1)[FLOAT]], [443 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Div_326 for ONNX node: Div_326 TRT - VERBOSE Registering tensor: 447 for ONNX tensor: 447 TRT - VERBOSE Div_326 [Div] outputs: [447 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Parsing node: LessOrEqual_328 [LessOrEqual] TRT - VERBOSE Searching for input: 447 TRT - VERBOSE Searching for input: 448 TRT - VERBOSE LessOrEqual_328 [LessOrEqual] inputs: [447 -> (-1, 1, -1, -1)[FLOAT]], [448 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 448 for ONNX node: 448 TRT - VERBOSE Registering layer: LessOrEqual_328 for ONNX node: LessOrEqual_328 TRT - VERBOSE Registering layer: LessOrEqual_328_97 for ONNX node: LessOrEqual_328 TRT - VERBOSE Registering layer: LessOrEqual_328_98 for ONNX node: LessOrEqual_328 TRT - VERBOSE Registering tensor: 449 for ONNX tensor: 449 TRT - VERBOSE LessOrEqual_328 [LessOrEqual] outputs: [449 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Greater_330 [Greater] TRT - VERBOSE Searching for input: 443 TRT - VERBOSE Searching for input: 450 TRT - VERBOSE Greater_330 [Greater] inputs: [443 -> (-1, 1, -1, -1)[FLOAT]], [450 -> ()[FLOAT]], TRT - VERBOSE Registering layer: 450 for ONNX node: 450 TRT - VERBOSE Registering layer: Greater_330 for ONNX node: Greater_330 TRT - VERBOSE Registering tensor: 451 for ONNX tensor: 451 TRT - VERBOSE Greater_330 [Greater] outputs: [451 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_331 [Cast] TRT - VERBOSE Searching for input: 449 TRT - VERBOSE Cast_331 [Cast] inputs: [449 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_331 for ONNX node: Cast_331 TRT - VERBOSE Registering tensor: 452 for ONNX tensor: 452 TRT - VERBOSE Cast_331 [Cast] outputs: [452 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_332 [Cast] TRT - VERBOSE Searching for input: 451 TRT - VERBOSE Cast_332 [Cast] inputs: [451 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_332 for ONNX node: Cast_332 TRT - VERBOSE Registering tensor: 453 for ONNX tensor: 453 TRT - VERBOSE Cast_332 [Cast] outputs: [453 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_333 [And] TRT - VERBOSE Searching for input: 452 TRT - VERBOSE Searching for input: 453 TRT - VERBOSE And_333 [And] inputs: [452 -> (-1, 1, -1, -1)[BOOL]], [453 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_333 for ONNX node: And_333 TRT - VERBOSE Registering tensor: 454 for ONNX tensor: 454 TRT - VERBOSE And_333 [And] outputs: [454 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_334 [Cast] TRT - VERBOSE Searching for input: 454 TRT - VERBOSE Cast_334 [Cast] inputs: [454 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_334 for ONNX node: Cast_334 TRT - VERBOSE Registering tensor: 455 for ONNX tensor: 455 TRT - VERBOSE Cast_334 [Cast] outputs: [455 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_335 [Cast] TRT - VERBOSE Searching for input: 455 TRT - VERBOSE Cast_335 [Cast] inputs: [455 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_335 for ONNX node: Cast_335 TRT - VERBOSE Registering tensor: 456 for ONNX tensor: 456 TRT - VERBOSE Cast_335 [Cast] outputs: [456 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_336 [Cast] TRT - VERBOSE Searching for input: 410 TRT - VERBOSE Cast_336 [Cast] inputs: [410 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_336 for ONNX node: Cast_336 TRT - VERBOSE Registering tensor: 457 for ONNX tensor: 457 TRT - VERBOSE Cast_336 [Cast] outputs: [457 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: And_337 [And] TRT - VERBOSE Searching for input: 456 TRT - VERBOSE Searching for input: 457 TRT - VERBOSE And_337 [And] inputs: [456 -> (-1, 1, -1, -1)[BOOL]], [457 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Registering layer: And_337 for ONNX node: And_337 TRT - VERBOSE Registering tensor: 458 for ONNX tensor: 458 TRT - VERBOSE And_337 [And] outputs: [458 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_338 [Cast] TRT - VERBOSE Searching for input: 458 TRT - VERBOSE Cast_338 [Cast] inputs: [458 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: bool TRT - VERBOSE Registering layer: Cast_338 for ONNX node: Cast_338 TRT - VERBOSE Registering tensor: 459 for ONNX tensor: 459 TRT - VERBOSE Cast_338 [Cast] outputs: [459 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Parsing node: Cast_339 [Cast] TRT - VERBOSE Searching for input: 459 TRT - VERBOSE Cast_339 [Cast] inputs: [459 -> (-1, 1, -1, -1)[BOOL]], TRT - VERBOSE Casting to type: int32 TRT - VERBOSE Registering layer: Cast_339 for ONNX node: Cast_339 TRT - VERBOSE Registering tensor: 460 for ONNX tensor: 460 TRT - VERBOSE Cast_339 [Cast] outputs: [460 -> (-1, 1, -1, -1)[INT32]], TRT - VERBOSE Parsing node: Shape_340 [Shape] TRT - VERBOSE Searching for input: 460 TRT - VERBOSE Shape_340 [Shape] inputs: [460 -> (-1, 1, -1, -1)[INT32]], TRT - VERBOSE Registering layer: Shape_340 for ONNX node: Shape_340 TRT - VERBOSE Registering tensor: 461 for ONNX tensor: 461 TRT - VERBOSE Shape_340 [Shape] outputs: [461 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_342 [Gather] TRT - VERBOSE Searching for input: 461 TRT - VERBOSE Searching for input: 462 TRT - VERBOSE Gather_342 [Gather] inputs: [461 -> (4)[INT32]], [462 -> ()[INT32]], TRT - VERBOSE Registering layer: 462 for ONNX node: 462 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_342 for ONNX node: Gather_342 TRT - VERBOSE Registering tensor: 463 for ONNX tensor: 463 TRT - VERBOSE Gather_342 [Gather] outputs: [463 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_343 [Shape] TRT - VERBOSE Searching for input: 460 TRT - VERBOSE Shape_343 [Shape] inputs: [460 -> (-1, 1, -1, -1)[INT32]], TRT - VERBOSE Registering layer: Shape_343 for ONNX node: Shape_343 TRT - VERBOSE Registering tensor: 464 for ONNX tensor: 464 TRT - VERBOSE Shape_343 [Shape] outputs: [464 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_345 [Gather] TRT - VERBOSE Searching for input: 464 TRT - VERBOSE Searching for input: 465 TRT - VERBOSE Gather_345 [Gather] inputs: [464 -> (4)[INT32]], [465 -> ()[INT32]], TRT - VERBOSE Registering layer: 465 for ONNX node: 465 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_345 for ONNX node: Gather_345 TRT - VERBOSE Registering tensor: 466 for ONNX tensor: 466 TRT - VERBOSE Gather_345 [Gather] outputs: [466 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_347 [Unsqueeze] TRT - VERBOSE Searching for input: 463 TRT - VERBOSE Searching for input: 467 TRT - VERBOSE Unsqueeze_347 [Unsqueeze] inputs: [463 -> ()[INT32]], [467 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_347 for ONNX node: Unsqueeze_347 TRT - VERBOSE Registering tensor: 468 for ONNX tensor: 468 TRT - VERBOSE Unsqueeze_347 [Unsqueeze] outputs: [468 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_349 [Unsqueeze] TRT - VERBOSE Searching for input: 466 TRT - VERBOSE Searching for input: 469 TRT - VERBOSE Unsqueeze_349 [Unsqueeze] inputs: [466 -> ()[INT32]], [469 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_349 for ONNX node: Unsqueeze_349 TRT - VERBOSE Registering tensor: 470 for ONNX tensor: 470 TRT - VERBOSE Unsqueeze_349 [Unsqueeze] outputs: [470 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_350 [Concat] TRT - VERBOSE Searching for input: 468 TRT - VERBOSE Searching for input: 470 TRT - VERBOSE Concat_350 [Concat] inputs: [468 -> (1)[INT32]], [470 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_350 for ONNX node: Concat_350 TRT - VERBOSE Registering tensor: 471 for ONNX tensor: 471 TRT - VERBOSE Concat_350 [Concat] outputs: [471 -> (2)[INT32]], TRT - VERBOSE Parsing node: Reshape_351 [Reshape] TRT - VERBOSE Searching for input: 460 TRT - VERBOSE Searching for input: 471 TRT - VERBOSE Reshape_351 [Reshape] inputs: [460 -> (-1, 1, -1, -1)[INT32]], [471 -> (2)[INT32]], TRT - VERBOSE Registering layer: Reshape_351 for ONNX node: Reshape_351 TRT - VERBOSE Registering tensor: 472 for ONNX tensor: 472 TRT - VERBOSE Reshape_351 [Reshape] outputs: [472 -> (-1, -1)[INT32]], TRT - VERBOSE Parsing node: Shape_352 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_352 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_352 for ONNX node: Shape_352 TRT - VERBOSE Registering tensor: 473 for ONNX tensor: 473 TRT - VERBOSE Shape_352 [Shape] outputs: [473 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_354 [Gather] TRT - VERBOSE Searching for input: 473 TRT - VERBOSE Searching for input: 474 TRT - VERBOSE Gather_354 [Gather] inputs: [473 -> (4)[INT32]], [474 -> ()[INT32]], TRT - VERBOSE Registering layer: 474 for ONNX node: 474 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_354 for ONNX node: Gather_354 TRT - VERBOSE Registering tensor: 475 for ONNX tensor: 475 TRT - VERBOSE Gather_354 [Gather] outputs: [475 -> ()[INT32]], TRT - VERBOSE Parsing node: Shape_355 [Shape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Shape_355 [Shape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], TRT - VERBOSE Registering layer: Shape_355 for ONNX node: Shape_355 TRT - VERBOSE Registering tensor: 476 for ONNX tensor: 476 TRT - VERBOSE Shape_355 [Shape] outputs: [476 -> (4)[INT32]], TRT - VERBOSE Parsing node: Gather_357 [Gather] TRT - VERBOSE Searching for input: 476 TRT - VERBOSE Searching for input: 477 TRT - VERBOSE Gather_357 [Gather] inputs: [476 -> (4)[INT32]], [477 -> ()[INT32]], TRT - VERBOSE Registering layer: 477 for ONNX node: 477 TRT - VERBOSE Using Gather axis: 0 TRT - VERBOSE Registering layer: Gather_357 for ONNX node: Gather_357 TRT - VERBOSE Registering tensor: 478 for ONNX tensor: 478 TRT - VERBOSE Gather_357 [Gather] outputs: [478 -> ()[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_359 [Unsqueeze] TRT - VERBOSE Searching for input: 475 TRT - VERBOSE Searching for input: 479 TRT - VERBOSE Unsqueeze_359 [Unsqueeze] inputs: [475 -> ()[INT32]], [479 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_359 for ONNX node: Unsqueeze_359 TRT - VERBOSE Registering tensor: 480 for ONNX tensor: 480 TRT - VERBOSE Unsqueeze_359 [Unsqueeze] outputs: [480 -> (1)[INT32]], TRT - VERBOSE Parsing node: Unsqueeze_361 [Unsqueeze] TRT - VERBOSE Searching for input: 478 TRT - VERBOSE Searching for input: 481 TRT - VERBOSE Unsqueeze_361 [Unsqueeze] inputs: [478 -> ()[INT32]], [481 -> (1)[INT32]], TRT - VERBOSE Original shape: (), unsqueezing to: (1,) TRT - VERBOSE Registering layer: Unsqueeze_361 for ONNX node: Unsqueeze_361 TRT - VERBOSE Registering tensor: 482 for ONNX tensor: 482 TRT - VERBOSE Unsqueeze_361 [Unsqueeze] outputs: [482 -> (1)[INT32]], TRT - VERBOSE Parsing node: Concat_362 [Concat] TRT - VERBOSE Searching for input: 480 TRT - VERBOSE Searching for input: 482 TRT - VERBOSE Concat_362 [Concat] inputs: [480 -> (1)[INT32]], [482 -> (1)[INT32]], TRT - VERBOSE Registering layer: Concat_362 for ONNX node: Concat_362 TRT - VERBOSE Registering tensor: 483 for ONNX tensor: 483 TRT - VERBOSE Concat_362 [Concat] outputs: [483 -> (2)[INT32]], TRT - VERBOSE Parsing node: Reshape_363 [Reshape] TRT - VERBOSE Searching for input: 350 TRT - VERBOSE Searching for input: 483 TRT - VERBOSE Reshape_363 [Reshape] inputs: [350 -> (-1, 1, -1, -1)[FLOAT]], [483 -> (2)[INT32]], TRT - VERBOSE Registering layer: Reshape_363 for ONNX node: Reshape_363 TRT - VERBOSE Registering tensor: score_map_99 for ONNX tensor: score_map TRT - VERBOSE Reshape_363 [Reshape] outputs: [score_map -> (-1, -1)[FLOAT]], TRT - VERBOSE Parsing node: Cast_364 [Cast] TRT - VERBOSE Searching for input: 472 TRT - VERBOSE Cast_364 [Cast] inputs: [472 -> (-1, -1)[INT32]], TRT - VERBOSE Casting to type: float32 TRT - VERBOSE Registering layer: Cast_364 for ONNX node: Cast_364 TRT - VERBOSE Registering tensor: mask_100 for ONNX tensor: mask TRT - VERBOSE Cast_364 [Cast] outputs: [mask -> (-1, -1)[FLOAT]], TRT - VERBOSE Marking mask_100 as output: mask TRT - VERBOSE Marking score_map_99 as output: score_map TRT - VERBOSE Marking dense_feat_map_82 as output: dense_feat_map Completed Onnx file parsing TensorRT model parse report: Model parsing OK! Network Description Input 'input' with shape (-1, -1, -1, -1) and dtype DataType.FLOAT Output 'mask' with shape (-1, -1) and dtype DataType.FLOAT Output 'score_map' with shape (-1, -1) and dtype DataType.FLOAT Output 'dense_feat_map' with shape (-1, 128, -1, -1) and dtype DataType.FLOAT =============================================================== Beginning Trt engine building Start model optimization... TRT - VERBOSE Applying generic optimizations to the graph for inference. TRT - VERBOSE Original: 131 layers TRT - VERBOSE After dead-layer removal: 131 layers TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing (Unnamed Layer* 33) [Constant] with (Unnamed Layer* 34) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 95 with (Unnamed Layer* 38) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 178 with (Unnamed Layer* 85) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 181 with (Unnamed Layer* 89) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 263 with (Unnamed Layer* 136) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 266 with (Unnamed Layer* 140) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 348 with (Unnamed Layer* 187) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 357 with (Unnamed Layer* 197) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing (Unnamed Layer* 214) [Constant] with (Unnamed Layer* 215) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 445 with (Unnamed Layer* 242) [Shuffle] TRT - VERBOSE Running: ConstShuffleFusion TRT - VERBOSE ConstShuffleFusion: Fusing 450 with (Unnamed Layer* 252) [Shuffle] TRT - VERBOSE After Myelin optimization: 82 layers TRT - VERBOSE Applying ScaleNodes fusions. TRT - VERBOSE Running: ConstEltFusion TRT - VERBOSE ConstEltFusion: Fusing 178 + (Unnamed Layer* 85) [Shuffle] with Mul_110 TRT - VERBOSE Running: ConstEltFusion TRT - VERBOSE ConstEltFusion: Fusing 263 + (Unnamed Layer* 136) [Shuffle] with Mul_181 TRT - VERBOSE Running: ConstEltFusion TRT - VERBOSE ConstEltFusion: Fusing 348 + (Unnamed Layer* 187) [Shuffle] with Mul_252 TRT - VERBOSE After scale fusion: 79 layers TRT - VERBOSE Running: ConvReluFusion TRT - VERBOSE ConvReluFusion: Fusing Conv_6 with Relu_7 TRT - VERBOSE Running: ScaleActivationFusion TRT - VERBOSE ScaleActivationFusion: Fusing BatchNormalization_11 with Relu_12 TRT - VERBOSE Running: ConvReluFusion TRT - VERBOSE ConvReluFusion: Fusing Conv_13 with Relu_14 TRT - VERBOSE Running: ScaleActivationFusion TRT - VERBOSE ScaleActivationFusion: Fusing BatchNormalization_18 with Relu_19 TRT - VERBOSE Running: ConvReluFusion TRT - VERBOSE ConvReluFusion: Fusing Conv_20 with Relu_21 TRT - VERBOSE Running: ConvReluFusion TRT - VERBOSE ConvReluFusion: Fusing Conv_22 with Relu_23 TRT - VERBOSE Running: ConvReluFusion TRT - VERBOSE ConvReluFusion: Fusing Conv_24 with Relu_25 TRT - VERBOSE Running: ConvReluFusion TRT - VERBOSE ConvReluFusion: Fusing Conv_26 with Relu_27 TRT - VERBOSE Running: ActivationToPointwiseConversion TRT - VERBOSE Swap the layer type of Softplus_93 from ACTIVATION to POINTWISE TRT - VERBOSE Running: ActivationToPointwiseConversion TRT - VERBOSE Swap the layer type of Softplus_91 from ACTIVATION to POINTWISE TRT - VERBOSE Running: ActivationToPointwiseConversion TRT - VERBOSE Swap the layer type of Softplus_164 from ACTIVATION to POINTWISE TRT - VERBOSE Running: ActivationToPointwiseConversion TRT - VERBOSE Swap the layer type of Softplus_162 from ACTIVATION to POINTWISE TRT - VERBOSE Running: ActivationToPointwiseConversion TRT - VERBOSE Swap the layer type of Softplus_235 from ACTIVATION to POINTWISE TRT - VERBOSE Running: ActivationToPointwiseConversion TRT - VERBOSE Swap the layer type of Softplus_233 from ACTIVATION to POINTWISE TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing 95 + (Unnamed Layer* 38) [Shuffle] with Div_41 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing Sub_92 with PWN(Softplus_93) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(Sub_92, PWN(Softplus_93)) with Mul_94 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing Sub_90 with PWN(Softplus_91) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(Sub_90, PWN(Softplus_91)) with PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing 181 + (Unnamed Layer* 89) [Shuffle] with Div_113 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing Sub_163 with PWN(Softplus_164) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(Sub_163, PWN(Softplus_164)) with Mul_165 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing Sub_161 with PWN(Softplus_162) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(Sub_161, PWN(Softplus_162)) with PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing 266 + (Unnamed Layer* 140) [Shuffle] with Div_184 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing Sub_234 with PWN(Softplus_235) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(Sub_234, PWN(Softplus_235)) with Mul_236 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing Sub_232 with PWN(Softplus_233) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(Sub_232, PWN(Softplus_233)) with PWN(PWN(Sub_234, PWN(Softplus_235)), Mul_236) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing 178 + (Unnamed Layer* 85) [Shuffle] + Mul_110 with Add_111 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(178 + (Unnamed Layer* 85) [Shuffle] + Mul_110, Add_111) with Add_182 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing 263 + (Unnamed Layer* 136) [Shuffle] + Mul_181 with PWN(PWN(178 + (Unnamed Layer* 85) [Shuffle] + Mul_110, Add_111), Add_182) TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing PWN(263 + (Unnamed Layer* 136) [Shuffle] + Mul_181, PWN(PWN(178 + (Unnamed Layer* 85) [Shuffle] + Mul_110, Add_111), Add_182)) with Add_253 TRT - VERBOSE Running: PointWiseFusion TRT - VERBOSE PointWiseFusion: Fusing 348 + (Unnamed Layer* 187) [Shuffle] + Mul_252 with PWN(PWN(263 + (Unnamed Layer* 136) [Shuffle] + Mul_181, PWN(PWN(178 + (Unnamed Layer* 85) [Shuffle] + Mul_110, Add_111), Add_182)), Add_253) TRT - VERBOSE After vertical fusions: 51 layers TRT - VERBOSE After dupe layer removal: 51 layers TRT - VERBOSE After final dead-layer removal: 51 layers TRT - VERBOSE After tensor merging: 51 layers TRT - VERBOSE After slice removal: 51 layers TRT - VERBOSE After concat removal: 51 layers TRT - VERBOSE Graph construction and optimization completed in 0.0271288 seconds. TRT - VERBOSE Using cublasLt as a tactic source TRT - WARNING TensorRT was linked against cuBLAS/cuBLAS LT 11.8.0 but loaded cuBLAS/cuBLAS LT 11.5.2 TRT - INFO [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +8, now: CPU 10879, GPU 2495 (MiB) TRT - VERBOSE Using cuDNN as a tactic source TRT - INFO [MemUsageChange] Init cuDNN: CPU +0, GPU +8, now: CPU 10879, GPU 2503 (MiB) TRT - WARNING TensorRT was linked against cuDNN 8.3.2 but loaded cuDNN 8.2.4 TRT - INFO Local timing cache in use. Profiling results in this builder pass will not be stored. TRT - VERBOSE Constructing optimization profile number 0 [1/1]. TRT - VERBOSE Reserving memory for activation tensors. Host: 0 bytes Device: 24246144 bytes TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.009216 TRT - VERBOSE Fastest Tactic: 0 Time: 0.009216 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01536 TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 0 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.171008 TRT - VERBOSE Tactic: 0 Time: 1.51347 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.171008 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.011264 TRT - VERBOSE Tactic: 0 Time: 0.009216 TRT - VERBOSE Fastest Tactic: 0 Time: 0.009216 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.0768 TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 0 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.183296 TRT - VERBOSE Tactic: 0 Time: 1.56877 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.183296 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.0768 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.08192 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.198656 TRT - VERBOSE Tactic: 0 Time: 1.5145 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.198656 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.084992 TRT - VERBOSE Tactic: 0 Time: 0.054272 TRT - VERBOSE Fastest Tactic: 0 Time: 0.054272 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.083968 TRT - VERBOSE Tactic: 0 Time: 0.055296 TRT - VERBOSE Fastest Tactic: 0 Time: 0.055296 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(380 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.095232 TRT - VERBOSE Tactic: 0 Time: 0.067584 TRT - VERBOSE Fastest Tactic: 0 Time: 0.067584 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) -10) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 3 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,3) where E0=(* 3 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(input -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.01536 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01536 TRT - VERBOSE *************** Autotuning Reformat: Float((* 3 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(input -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01536 TRT - VERBOSE Tactic: 0 Time: 0.017408 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.01536 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(44 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.34816 TRT - VERBOSE Tactic: 0 Time: 0.351232 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.34816 TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(44 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.347136 TRT - VERBOSE Tactic: 0 Time: 0.351232 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.347136 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(44 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.349184 TRT - VERBOSE Tactic: 0 Time: 1.46534 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.349184 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(44 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.3584 TRT - VERBOSE Tactic: 0 Time: 0.351232 TRT - VERBOSE Fastest Tactic: 0 Time: 0.351232 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(44 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.34816 TRT - VERBOSE Tactic: 0 Time: 1.4633 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.34816 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(44 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.352256 TRT - VERBOSE Tactic: 0 Time: 0.352256 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.352256 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.349184 TRT - VERBOSE Tactic: 0 Time: 0.350208 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.349184 TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.346112 TRT - VERBOSE Tactic: 0 Time: 0.351232 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.346112 TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.349184 TRT - VERBOSE Tactic: 0 Time: 1.82374 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.349184 TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.930816 TRT - VERBOSE Tactic: 0 Time: 1.64659 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.930816 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.350208 TRT - VERBOSE Tactic: 0 Time: 1.46944 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.350208 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.354304 TRT - VERBOSE Tactic: 0 Time: 0.350208 TRT - VERBOSE Fastest Tactic: 0 Time: 0.350208 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.357376 TRT - VERBOSE Tactic: 0 Time: 3.0249 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.357376 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.351232 TRT - VERBOSE Tactic: 0 Time: 0.109568 TRT - VERBOSE Fastest Tactic: 0 Time: 0.109568 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.038912 TRT - VERBOSE Tactic: 0 Time: 0.032768 TRT - VERBOSE Fastest Tactic: 0 Time: 0.032768 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 45) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.038912 TRT - VERBOSE Tactic: 0 Time: 0.304128 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.038912 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.039936 TRT - VERBOSE Tactic: 0 Time: 0.171008 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.039936 TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.100352 TRT - VERBOSE Tactic: 0 Time: 0.134144 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.100352 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.039936 TRT - VERBOSE Tactic: 0 Time: 0.304128 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.039936 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.04096 TRT - VERBOSE Tactic: 0 Time: 0.300032 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.04096 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.04096 TRT - VERBOSE Tactic: 0 Time: 0.106496 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.04096 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.04096 TRT - VERBOSE Tactic: 0 Time: 0.032768 TRT - VERBOSE Fastest Tactic: 0 Time: 0.032768 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.038912 TRT - VERBOSE Tactic: 0 Time: 0.033792 TRT - VERBOSE Fastest Tactic: 0 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(45 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.944128 TRT - VERBOSE Tactic: 0 Time: 0.06656 TRT - VERBOSE Fastest Tactic: 0 Time: 0.06656 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 96) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.041984 TRT - VERBOSE Tactic: 0 Time: 0.106496 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.041984 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 96) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.039936 TRT - VERBOSE Tactic: 0 Time: 0.032768 TRT - VERBOSE Fastest Tactic: 0 Time: 0.032768 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 96) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.038912 TRT - VERBOSE Tactic: 0 Time: 0.033792 TRT - VERBOSE Fastest Tactic: 0 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 96) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.943104 TRT - VERBOSE Tactic: 0 Time: 0.06656 TRT - VERBOSE Fastest Tactic: 0 Time: 0.06656 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(120 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.034816 TRT - VERBOSE Tactic: 0 Time: 0.031744 TRT - VERBOSE Fastest Tactic: 0 Time: 0.031744 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(120 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.943104 TRT - VERBOSE Tactic: 0 Time: 0.067584 TRT - VERBOSE Fastest Tactic: 0 Time: 0.067584 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(120 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.94208 TRT - VERBOSE Tactic: 0 Time: 0.114688 TRT - VERBOSE Fastest Tactic: 0 Time: 0.114688 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(143 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.03072 TRT - VERBOSE Fastest Tactic: 0 Time: 0.03072 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(143 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.103424 TRT - VERBOSE Tactic: 0 Time: 0.140288 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.103424 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(143 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.031744 TRT - VERBOSE Fastest Tactic: 0 Time: 0.031744 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(143 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.986112 TRT - VERBOSE Tactic: 0 Time: 0.140288 TRT - VERBOSE Fastest Tactic: 0 Time: 0.140288 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(143 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.98304 TRT - VERBOSE Tactic: 0 Time: 0.069632 TRT - VERBOSE Fastest Tactic: 0 Time: 0.069632 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(143 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.98304 TRT - VERBOSE Tactic: 0 Time: 0.069632 TRT - VERBOSE Fastest Tactic: 0 Time: 0.069632 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.031744 TRT - VERBOSE Fastest Tactic: 0 Time: 0.031744 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.100352 TRT - VERBOSE Tactic: 0 Time: 0.134144 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.100352 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.9728 TRT - VERBOSE Tactic: 0 Time: 5.05856 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.9728 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.945152 TRT - VERBOSE Tactic: 0 Time: 0.134144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.134144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.946176 TRT - VERBOSE Tactic: 0 Time: 5.05139 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.946176 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.94208 TRT - VERBOSE Tactic: 0 Time: 0.067584 TRT - VERBOSE Fastest Tactic: 0 Time: 0.067584 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.946176 TRT - VERBOSE Tactic: 0 Time: 5.05754 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.946176 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.820224 TRT - VERBOSE Tactic: 0 Time: 0.114688 TRT - VERBOSE Fastest Tactic: 0 Time: 0.114688 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(145 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.823296 TRT - VERBOSE Tactic: 0 Time: 0.219136 TRT - VERBOSE Fastest Tactic: 0 Time: 0.219136 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(54 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.021504 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(54 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.021504 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(54 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.021504 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(54 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.017408 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.017408 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(54 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.0256 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.021504 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(54 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.021504 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.022528 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.04608 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.048128 TRT - VERBOSE Tactic: 0 Time: 0.070656 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.048128 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.022528 TRT - VERBOSE Tactic: 0 Time: 0.026624 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.022528 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.02048 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.055296 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.021504 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.021504 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 55) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.05632 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.045056 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.048128 TRT - VERBOSE Tactic: 0 Time: 0.070656 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.048128 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.05632 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.05632 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.023552 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.021504 TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(55 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.41472 TRT - VERBOSE Tactic: 0 Time: 0.037888 TRT - VERBOSE Fastest Tactic: 0 Time: 0.037888 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 182) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.022528 TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.022528 TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 182) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.017408 TRT - VERBOSE Tactic: 0 Time: 0.018176 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.017408 TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 182) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.018432 TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 182) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.413696 TRT - VERBOSE Tactic: 0 Time: 0.03584 TRT - VERBOSE Fastest Tactic: 0 Time: 0.03584 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.007168 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.16384 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.008192 TRT - VERBOSE Fastest Tactic: 0 Time: 0.008192 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.16384 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.006784 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006784 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.166912 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.03472 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(156 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.007104 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007104 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(206 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.02048 TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Fastest Tactic: 0 Time: 0.018432 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(206 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.411648 TRT - VERBOSE Tactic: 0 Time: 0.037888 TRT - VERBOSE Fastest Tactic: 0 Time: 0.037888 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(206 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.418816 TRT - VERBOSE Tactic: 0 Time: 0.06144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.06144 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(229 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.02048 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 0 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(229 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.050176 TRT - VERBOSE Tactic: 0 Time: 0.072704 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.050176 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(229 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 0 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(229 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.436224 TRT - VERBOSE Tactic: 0 Time: 0.073728 TRT - VERBOSE Fastest Tactic: 0 Time: 0.073728 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(229 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.436224 TRT - VERBOSE Tactic: 0 Time: 0.037888 TRT - VERBOSE Fastest Tactic: 0 Time: 0.037888 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 5) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 5) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(229 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.4352 TRT - VERBOSE Tactic: 0 Time: 0.037888 TRT - VERBOSE Fastest Tactic: 0 Time: 0.037888 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.021504 TRT - VERBOSE Tactic: 0 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 0 Time: 0.019456 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.047104 TRT - VERBOSE Tactic: 0 Time: 0.069632 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.047104 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.425984 TRT - VERBOSE Tactic: 0 Time: 1.22778 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.425984 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.413696 TRT - VERBOSE Tactic: 0 Time: 0.069632 TRT - VERBOSE Fastest Tactic: 0 Time: 0.069632 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.413696 TRT - VERBOSE Tactic: 0 Time: 1.22675 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.413696 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.411648 TRT - VERBOSE Tactic: 0 Time: 0.03584 TRT - VERBOSE Fastest Tactic: 0 Time: 0.03584 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.413696 TRT - VERBOSE Tactic: 0 Time: 1.26259 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.413696 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.411648 TRT - VERBOSE Tactic: 0 Time: 0.060416 TRT - VERBOSE Fastest Tactic: 0 Time: 0.060416 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(230 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.412672 TRT - VERBOSE Tactic: 0 Time: 0.111616 TRT - VERBOSE Fastest Tactic: 0 Time: 0.111616 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(64 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.014336 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.009216 TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(64 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.01536 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.013312 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(64 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01536 TRT - VERBOSE Tactic: 0 Time: 0.013312 TRT - VERBOSE Fastest Tactic: 0 Time: 0.013312 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(64 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(64 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01536 TRT - VERBOSE Tactic: 0 Time: 0.01536 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.01536 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(64 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.02048 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.023552 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.02048 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(241 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.019456 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> dense_feat_map) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01536 TRT - VERBOSE Tactic: 0 Time: 0.014336 TRT - VERBOSE Fastest Tactic: 0 Time: 0.014336 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> dense_feat_map) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.014336 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.014336 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(dense_feat_map -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.023552 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.013312 TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(dense_feat_map -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.027648 TRT - VERBOSE Tactic: 0 Time: 0.036864 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.027648 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.014336 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.013312 TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.01536 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.014336 TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.026624 TRT - VERBOSE Tactic: 0 Time: 0.036864 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.026624 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.03072 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.029696 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.014336 TRT - VERBOSE Fastest Tactic: 0 Time: 0.014336 TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 0 Time: 0.011264 TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 0 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 267) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.20992 TRT - VERBOSE Tactic: 0 Time: 0.022528 TRT - VERBOSE Fastest Tactic: 0 Time: 0.022528 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(267 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01536 TRT - VERBOSE Tactic: 0 Time: 0.014336 TRT - VERBOSE Fastest Tactic: 0 Time: 0.014336 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(267 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.208896 TRT - VERBOSE Tactic: 0 Time: 0.02048 TRT - VERBOSE Fastest Tactic: 0 Time: 0.02048 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(267 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.03072 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.014336 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(267 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.029696 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(267 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 0 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(267 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.012288 TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.012288 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(291 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.008192 TRT - VERBOSE Fastest Tactic: 0 Time: 0.008192 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(291 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.20992 TRT - VERBOSE Tactic: 0 Time: 0.021504 TRT - VERBOSE Fastest Tactic: 0 Time: 0.021504 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(291 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.210944 TRT - VERBOSE Tactic: 0 Time: 0.034816 TRT - VERBOSE Fastest Tactic: 0 Time: 0.034816 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(314 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013376 TRT - VERBOSE Tactic: 0 Time: 0.009216 TRT - VERBOSE Fastest Tactic: 0 Time: 0.009216 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(314 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.029696 TRT - VERBOSE Tactic: 0 Time: 0.038912 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.029696 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(314 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.008192 TRT - VERBOSE Fastest Tactic: 0 Time: 0.008192 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(314 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.224256 TRT - VERBOSE Tactic: 0 Time: 0.038912 TRT - VERBOSE Fastest Tactic: 0 Time: 0.038912 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(314 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.222208 TRT - VERBOSE Tactic: 0 Time: 0.023552 TRT - VERBOSE Fastest Tactic: 0 Time: 0.023552 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 3) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 3) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(314 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.223232 TRT - VERBOSE Tactic: 0 Time: 0.021504 TRT - VERBOSE Fastest Tactic: 0 Time: 0.021504 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013408 TRT - VERBOSE Tactic: 0 Time: 0.009184 TRT - VERBOSE Fastest Tactic: 0 Time: 0.009184 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.028672 TRT - VERBOSE Tactic: 0 Time: 0.036864 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.028672 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.216064 TRT - VERBOSE Tactic: 0 Time: 0.613376 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.216064 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.210944 TRT - VERBOSE Tactic: 0 Time: 0.036864 TRT - VERBOSE Fastest Tactic: 0 Time: 0.036864 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.20992 TRT - VERBOSE Tactic: 0 Time: 0.612352 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.20992 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.20992 TRT - VERBOSE Tactic: 0 Time: 0.023552 TRT - VERBOSE Fastest Tactic: 0 Time: 0.023552 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.211968 TRT - VERBOSE Tactic: 0 Time: 0.632832 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.211968 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.20992 TRT - VERBOSE Tactic: 0 Time: 0.033792 TRT - VERBOSE Fastest Tactic: 0 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(315 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.20992 TRT - VERBOSE Tactic: 0 Time: 0.058368 TRT - VERBOSE Fastest Tactic: 0 Time: 0.058368 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.01024 TRT - VERBOSE Tactic: 0 Time: 0.011264 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.01024 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.017408 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.017408 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.010208 TRT - VERBOSE Fastest Tactic: 0 Time: 0.010208 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.017408 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.016384 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(326 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.017408 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* 4 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0),E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) -> Float((* 128 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.034816 TRT - VERBOSE Tactic: 0 Time: 0.16384 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.034816 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.006144 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.007136 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007136 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.16384 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.031744 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.165888 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.009216 TRT - VERBOSE Fastest Tactic: 0 Time: 0.009216 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.010144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.010144 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 350) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 434) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.006208 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.006208 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat( -> 434) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.009216 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(434 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.008192 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(434 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.008192 TRT - VERBOSE Tactic: 0 Time: 0.008064 TRT - VERBOSE Fastest Tactic: 0 Time: 0.008064 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(434 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.007168 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(434 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.034624 TRT - VERBOSE Tactic: 0 Time: 0.008192 TRT - VERBOSE Fastest Tactic: 0 Time: 0.008192 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(434 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.03584 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Optimizer Reformat(434 -> ) (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.034816 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) -> Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float(1:4,(* (# 2 (SHAPE input)) (# 3 (SHAPE input))),(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE *************** Autotuning Reformat: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing reformatting costs TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: -> Int32(), Int32() *************** TRT - VERBOSE --------------- Timing Runner: [HostToDeviceCopy] (ShapeHostToDevice) TRT - VERBOSE Tactic: 0 Time: 0.008192 TRT - VERBOSE Fastest Tactic: 0 Time: 0.008192 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: ShapeHostToDevice Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: -> Float(1,1,1,1) *************** TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(1,1,1,1) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: ConstantOfShape_277 (Padding) TRT - VERBOSE Padding has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: ConstantOfShape_277 (Slice) TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E1,1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.014336 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E1,1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.008224 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.141312 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 1002 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.013312 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,1) -> Float(E1,1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.006144 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.006144 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 1002 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,1) -> Float(E1,1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.031744 TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,1) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.141312 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.032768 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 1002 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.031744 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,1) -> Float(E1,1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.031744 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,1) -> Float(E1,1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.03072 TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,1) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.033792 TRT - VERBOSE Tactic: 0 Time: 0.14336 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.033792 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 1002 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1:32,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.031744 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1:32,E0,1) -> Float(E1,1,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.009312 TRT - VERBOSE Fastest Tactic: 0 Time: 0.009312 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1:32,E0,1) -> Float(E1,1:4,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.031744 TRT - VERBOSE Tactic: 0 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 0 Time: 0.01024 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1:32,E0,1) -> Float(E1,E1:32,E0,1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) *************** TRT - VERBOSE --------------- Timing Runner: Cast_278 (Cast) TRT - VERBOSE Cast has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Cast_278 (Reformat) TRT - VERBOSE Tactic: 1002 Time: 0.032768 TRT - VERBOSE Tactic: 0 Time: 0.14848 TRT - VERBOSE Fastest Tactic: 1002 Time: 0.032768 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reformat Tactic: 1002 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E2,E2,(# 3 (SHAPE input)),1) where E0=(+ (# 3 (SHAPE input)) -10) E1=(* (+ (# 2 (SHAPE input)) -10) E0) E2=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Pad_293 (Padding) TRT - VERBOSE Tactic: 0 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 0 Time: 0.00512 TRT - VERBOSE --------------- Timing Runner: Pad_293 (Slice) TRT - VERBOSE Tactic: 0 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.006144 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Padding Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 3 E0),E0,(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Conv_6 + Relu_7 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_6 + Relu_7 (FusedConvActConvolution) TRT - VERBOSE FusedConvActConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_6 + Relu_7 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.229376 TRT - VERBOSE Tactic: 1 Time: 0.202752 TRT - VERBOSE Tactic: 2 Time: 0.308224 TRT - VERBOSE Tactic: 5 Time: 2.84467 TRT - VERBOSE Tactic: 6 Time: 0.241664 TRT - VERBOSE Tactic: 56 Time: 0.203776 TRT - VERBOSE Tactic: 57 Time: 0.169984 TRT - VERBOSE Tactic: 58 Time: 0.304128 TRT - VERBOSE Tactic: 61 Time: 2.85082 TRT - VERBOSE Tactic: 62 Time: 0.214016 TRT - VERBOSE Tactic: 112 Time: 0.202752 TRT - VERBOSE Tactic: 113 Time: 0.169984 TRT - VERBOSE Tactic: 114 Time: 0.310272 TRT - VERBOSE Tactic: 117 Time: 2.84058 TRT - VERBOSE Tactic: 118 Time: 0.21504 TRT - VERBOSE Fastest Tactic: 57 Time: 0.169984 TRT - VERBOSE --------------- Timing Runner: Conv_6 + Relu_7 (CaskConvolution) TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.279552 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.162816 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.106496 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.152576 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.264192 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.19456 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.257024 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.285696 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.116736 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: -7777264329408437990 TRT - VERBOSE Tactic: -7777264329408437990 Time: 0.106496 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.154624 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.059392 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 0.274432 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.185344 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.196608 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.108544 TRT - VERBOSE Fastest Tactic: -6313876406580483184 Time: 0.059392 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -6313876406580483184 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) E0),1,E0,3) -> Float((* (# 2 (SHAPE input)) E1),1,E1,32) where E0=(* 3 (# 3 (SHAPE input))) E1=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Conv_6 + Relu_7 (CaskConvolution) TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.19968 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.177152 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.859136 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.182272 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.185344 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.1792 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.325632 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.173056 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.173056 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Conv_6 + Relu_7 (CaskConvolution) TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.19968 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.177152 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.861184 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.183296 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.391168 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.185344 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.1792 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.381952 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.385024 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.326656 TRT - VERBOSE Conv_6 + Relu_7 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.173056 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.173056 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E0,(# 3 (SHAPE input)),1) -> Float(E1,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) E1=(* 32 E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_8 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_8 (FusedConvActConvolution) TRT - VERBOSE FusedConvActConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_8 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.429056 TRT - VERBOSE Tactic: 1 Time: 0.32768 TRT - VERBOSE Tactic: 2 Time: 1.35885 TRT - VERBOSE Tactic: 5 Time: 3.86867 TRT - VERBOSE Tactic: 6 Time: 0.212992 TRT - VERBOSE Tactic: 56 Time: 0.43008 TRT - VERBOSE Tactic: 57 Time: 0.328704 TRT - VERBOSE Tactic: 58 Time: 1.35782 TRT - VERBOSE Tactic: 61 Time: 3.88608 TRT - VERBOSE Tactic: 62 Time: 0.212992 TRT - VERBOSE Tactic: 112 Time: 0.429056 TRT - VERBOSE Tactic: 113 Time: 0.329728 TRT - VERBOSE Tactic: 114 Time: 1.34656 TRT - VERBOSE Tactic: 117 Time: 3.87379 TRT - VERBOSE Tactic: 118 Time: 0.214016 TRT - VERBOSE Fastest Tactic: 6 Time: 0.212992 TRT - VERBOSE --------------- Timing Runner: Conv_8 (CaskConvolution) TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 1.11206 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.626688 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.539648 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.637952 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 1.05574 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.94208 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 1.03322 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 1.11923 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.52224 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: -7777264329408437990 TRT - VERBOSE Tactic: -7777264329408437990 Time: 0.202752 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.64512 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.321536 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 1.06598 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.519168 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.544768 TRT - VERBOSE Conv_8 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.320512 TRT - VERBOSE Fastest Tactic: -7777264329408437990 Time: 0.202752 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7777264329408437990 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,32) -> Float(E1,1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_8 (CaskConvolution) TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.34816 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.282624 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 1.31584 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.306176 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.295936 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.313344 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.523264 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.280576 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.280576 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,8) -> Float(E1,1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_8 (CaskConvolution) TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.405504 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.395264 TRT - VERBOSE Conv_8 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.397312 TRT - VERBOSE Fastest Tactic: -7377458734869418330 Time: 0.395264 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7377458734869418330 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E0,(# 3 (SHAPE input)),1) -> Float(E1,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) E1=(* 32 E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.054272 TRT - VERBOSE Tactic: 1 Time: 0.054272 TRT - VERBOSE Tactic: 2 Time: 0.05632 TRT - VERBOSE Tactic: 3 Time: 0.05632 TRT - VERBOSE Tactic: 4 Time: 0.055296 TRT - VERBOSE Tactic: 5 Time: 0.05632 TRT - VERBOSE Tactic: 6 Time: 0.055296 TRT - VERBOSE Tactic: 7 Time: 0.05632 TRT - VERBOSE Tactic: 8 Time: 0.101376 TRT - VERBOSE Tactic: 9 Time: 0.094208 TRT - VERBOSE Tactic: 28 Time: 0.123904 TRT - VERBOSE Fastest Tactic: 0 Time: 0.054272 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,32) -> Float(E1,1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.131072 TRT - VERBOSE Tactic: 1 Time: 0.109568 TRT - VERBOSE Tactic: 2 Time: 0.120832 TRT - VERBOSE Tactic: 3 Time: 0.095232 TRT - VERBOSE Tactic: 4 Time: 0.114688 TRT - VERBOSE Tactic: 5 Time: 0.106496 TRT - VERBOSE Tactic: 6 Time: 0.090112 TRT - VERBOSE Tactic: 7 Time: 0.105472 TRT - VERBOSE Tactic: 8 Time: 0.110592 TRT - VERBOSE Tactic: 9 Time: 0.101376 TRT - VERBOSE Tactic: 28 Time: 0.123904 TRT - VERBOSE Fastest Tactic: 6 Time: 0.090112 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 6 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,8) -> Float(E1,1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.103424 TRT - VERBOSE Tactic: 1 Time: 0.098304 TRT - VERBOSE Tactic: 2 Time: 0.099328 TRT - VERBOSE Tactic: 3 Time: 0.096256 TRT - VERBOSE Tactic: 4 Time: 0.364544 TRT - VERBOSE Tactic: 5 Time: 0.365568 TRT - VERBOSE Tactic: 6 Time: 0.359424 TRT - VERBOSE Tactic: 7 Time: 0.365568 TRT - VERBOSE Tactic: 8 Time: 0.38912 TRT - VERBOSE Tactic: 9 Time: 0.393216 TRT - VERBOSE Tactic: 10 Time: 0.3584 TRT - VERBOSE Tactic: 11 Time: 0.3584 TRT - VERBOSE Tactic: 12 Time: 0.35328 TRT - VERBOSE Tactic: 13 Time: 0.804864 TRT - VERBOSE Tactic: 14 Time: 0.809984 TRT - VERBOSE Tactic: 15 Time: 0.807936 TRT - VERBOSE Tactic: 16 Time: 0.813056 TRT - VERBOSE Tactic: 17 Time: 0.82432 TRT - VERBOSE Tactic: 18 Time: 0.83968 TRT - VERBOSE Tactic: 19 Time: 0.804864 TRT - VERBOSE Tactic: 20 Time: 0.812032 TRT - VERBOSE Tactic: 21 Time: 0.826368 TRT - VERBOSE Tactic: 22 Time: 0.841728 TRT - VERBOSE Tactic: 23 Time: 0.836608 TRT - VERBOSE Tactic: 28 Time: 0.786432 TRT - VERBOSE Tactic: 29 Time: 0.782336 TRT - VERBOSE Tactic: 30 Time: 0.809984 TRT - VERBOSE Fastest Tactic: 3 Time: 0.096256 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 3 TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0:32,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWiseV2) TRT - VERBOSE Tactic: 24 Time: 0.791552 TRT - VERBOSE Tactic: 25 Time: 0.80384 TRT - VERBOSE Tactic: 26 Time: 0.817152 TRT - VERBOSE Tactic: 27 Time: 0.8192 TRT - VERBOSE Tactic: 31 Time: 0.799744 TRT - VERBOSE Fastest Tactic: 24 Time: 0.791552 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 24 TRT - VERBOSE *************** Autotuning format combination: Float(1:4,E0,(# 3 (SHAPE input)),1) -> Float(1:4,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: PWN(95 + (Unnamed Layer* 38) [Shuffle], Div_41) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 12.7375 TRT - VERBOSE Tactic: 1 Time: 14.6289 TRT - VERBOSE Tactic: 2 Time: 14.4886 TRT - VERBOSE Tactic: 3 Time: 16.382 TRT - VERBOSE Tactic: 4 Time: 17.5503 TRT - VERBOSE Tactic: 5 Time: 15.3108 TRT - VERBOSE Tactic: 6 Time: 19.7263 TRT - VERBOSE Tactic: 7 Time: 20.31 TRT - VERBOSE Tactic: 8 Time: 17.9128 TRT - VERBOSE Tactic: 9 Time: 20.992 TRT - VERBOSE Tactic: 10 Time: 7.37997 TRT - VERBOSE Tactic: 11 Time: 8.41011 TRT - VERBOSE Tactic: 12 Time: 7.85101 TRT - VERBOSE Tactic: 13 Time: 9.77613 TRT - VERBOSE Tactic: 14 Time: 9.52115 TRT - VERBOSE Tactic: 15 Time: 8.35891 TRT - VERBOSE Tactic: 16 Time: 12.4068 TRT - VERBOSE Tactic: 17 Time: 13.9735 TRT - VERBOSE Tactic: 18 Time: 13.2301 TRT - VERBOSE Tactic: 19 Time: 11.7094 TRT - VERBOSE Tactic: 20 Time: 6.12659 TRT - VERBOSE Tactic: 21 Time: 6.78605 TRT - VERBOSE Tactic: 22 Time: 7.92371 TRT - VERBOSE Tactic: 23 Time: 10.3066 TRT - VERBOSE Tactic: 28 Time: 3.15597 TRT - VERBOSE Tactic: 29 Time: 3.09555 TRT - VERBOSE Tactic: 30 Time: 3.20819 TRT - VERBOSE Fastest Tactic: 29 Time: 3.09555 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 29 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: ReduceMean_89 (Reduce) TRT - VERBOSE Tactic: 5 Time: 0.402432 TRT - VERBOSE Tactic: 7 Time: 0.39936 TRT - VERBOSE Tactic: 8 Time: 0.400384 TRT - VERBOSE Fastest Tactic: 7 Time: 0.39936 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reduce Tactic: 7 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_62 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.83456 TRT - VERBOSE Tactic: 1 Time: 1.58106 TRT - VERBOSE Fastest Tactic: 0 Time: 0.83456 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) E0),1,E0,32) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_62 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 3.34336 TRT - VERBOSE Tactic: 1 Time: 1.60256 TRT - VERBOSE Fastest Tactic: 1 Time: 1.60256 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_62 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 6.28019 TRT - VERBOSE Tactic: 1 Time: 2.7904 TRT - VERBOSE Fastest Tactic: 1 Time: 2.7904 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0:32,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_62 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 5.15686 TRT - VERBOSE Tactic: 1 Time: 0.884736 TRT - VERBOSE Fastest Tactic: 1 Time: 0.884736 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E0,(# 3 (SHAPE input)),1) -> Float(E1,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) E1=(* 32 E0) *************** TRT - VERBOSE --------------- Timing Runner: BatchNormalization_11 + Relu_12 (Scale) TRT - VERBOSE Tactic: 0 Time: 0.031744 TRT - VERBOSE Fastest Tactic: 0 Time: 0.031744 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Scale Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,32) -> Float(E1,1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: BatchNormalization_11 + Relu_12 (Scale) TRT - VERBOSE Scale has no valid tactics for this config, skipping TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,8) -> Float(E1,1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: BatchNormalization_11 + Relu_12 (Scale) TRT - VERBOSE Scale has no valid tactics for this config, skipping TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) -> Float((* 64 E2),E2,E1,1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) E1=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E2=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E1) *************** TRT - VERBOSE --------------- Timing Runner: Conv_13 + Relu_14 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_13 + Relu_14 (FusedConvActConvolution) TRT - VERBOSE Tactic: 458751 Time: 0.10752 TRT - VERBOSE Fastest Tactic: 458751 Time: 0.10752 TRT - VERBOSE --------------- Timing Runner: Conv_13 + Relu_14 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.099328 TRT - VERBOSE Tactic: 1 Time: 0.116736 TRT - VERBOSE Tactic: 2 Time: 0.17408 TRT - VERBOSE Tactic: 5 Time: 2.71462 TRT - VERBOSE Tactic: 56 Time: 0.098304 TRT - VERBOSE Tactic: 57 Time: 0.124928 TRT - VERBOSE Tactic: 58 Time: 0.175104 TRT - VERBOSE Tactic: 61 Time: 2.61018 TRT - VERBOSE Tactic: 112 Time: 0.100352 TRT - VERBOSE Tactic: 113 Time: 0.10752 TRT - VERBOSE Tactic: 114 Time: 0.17408 TRT - VERBOSE Tactic: 117 Time: 2.72896 TRT - VERBOSE Fastest Tactic: 56 Time: 0.098304 TRT - VERBOSE --------------- Timing Runner: Conv_13 + Relu_14 (CaskConvolution) TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.095232 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.05632 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.053248 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.060416 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.096256 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.084992 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.090112 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.099328 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.052224 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.06144 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.064512 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 0.096256 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.086016 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.083968 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.054272 TRT - VERBOSE Fastest Tactic: 6767548733843469815 Time: 0.052224 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 6767548733843469815 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) E0),1,E0,32) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E1),1,E1,64) where E0=(* 32 (# 3 (SHAPE input))) E1=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) *************** TRT - VERBOSE --------------- Timing Runner: Conv_13 + Relu_14 (CaskConvolution) TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.072704 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.079872 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.219136 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.069632 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.083968 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.064512 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.1024 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.079872 TRT - VERBOSE Fastest Tactic: -7734972403155710907 Time: 0.064512 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7734972403155710907 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E1),1:4,E1,16) where E0=(* 8 (# 3 (SHAPE input))) E1=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) *************** TRT - VERBOSE --------------- Timing Runner: Conv_13 + Relu_14 (CaskConvolution) TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.064512 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.063488 TRT - VERBOSE Conv_13 + Relu_14 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.064512 TRT - VERBOSE Fastest Tactic: -7377458734869418330 Time: 0.063488 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7377458734869418330 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0,(# 3 (SHAPE input)),1) -> Float(E2,E2,E1,1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) E1=(+ (# 3 (SHAPE input)) 6) E2=(* (+ (# 2 (SHAPE input)) 6) E1) *************** TRT - VERBOSE --------------- Timing Runner: Pad_76 (Padding) TRT - VERBOSE Padding has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Pad_76 (Slice) TRT - VERBOSE Tactic: 0 Time: 0.033792 TRT - VERBOSE Fastest Tactic: 0 Time: 0.033792 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E2,E2,(# 3 (SHAPE input)),1) where E0=(+ (# 3 (SHAPE input)) 6) E1=(* (+ (# 2 (SHAPE input)) 6) E0) E2=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Conv_78 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_78 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.92672 TRT - VERBOSE Tactic: 1 Time: 6.9632 TRT - VERBOSE Tactic: 2 Time: 1.03526 TRT - VERBOSE Tactic: 56 Time: 0.90624 TRT - VERBOSE Tactic: 57 Time: 6.96013 TRT - VERBOSE Tactic: 58 Time: 1.03526 TRT - VERBOSE Tactic: 112 Time: 0.90624 TRT - VERBOSE Tactic: 113 Time: 0.29184 TRT - VERBOSE Tactic: 114 Time: 1.03424 TRT - VERBOSE Fastest Tactic: 113 Time: 0.29184 TRT - VERBOSE --------------- Timing Runner: Conv_78 (CaskConvolution) TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 1.84627 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 1.05062 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.550912 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.903168 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 1.80838 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 1.19808 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 1.71315 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 1.91898 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.627712 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: -7491730084094677098 TRT - VERBOSE Tactic: -7491730084094677098 Time: 0.251904 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.9216 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.264192 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: -6273689210331812572 TRT - VERBOSE Tactic: -6273689210331812572 Time: 1.17862 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 1.85037 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: -4337126844824617177 TRT - VERBOSE Tactic: -4337126844824617177 Time: 0.523264 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 1.05472 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 1.19194 TRT - VERBOSE Conv_78 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.556032 TRT - VERBOSE Fastest Tactic: -7491730084094677098 Time: 0.251904 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7491730084094677098 TRT - VERBOSE *************** Autotuning format combination: Float((* (+ (# 2 (SHAPE input)) 6) E0),1,E0,1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Conv_78 (CaskConvolution) TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 1.64659 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 1.43462 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 11.8026 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 1.5104 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 1.50221 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 1.47354 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 2.74432 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 1.42336 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 1.42336 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE *************** Autotuning format combination: Float((* (+ (# 2 (SHAPE input)) 6) E0),1:4,E0,1) -> Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) where E0=(+ (# 3 (SHAPE input)) 6) *************** TRT - VERBOSE --------------- Timing Runner: Conv_78 (CaskConvolution) TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 1.66298 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 1.44486 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 11.8456 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 1.53702 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 3.40787 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 1.51245 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 1.48787 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 3.24301 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 3.27885 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 2.74739 TRT - VERBOSE Conv_78 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 1.42131 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 1.42131 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0,(# 3 (SHAPE input)),1) -> Float((* 32 E0),E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_88 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.03072 TRT - VERBOSE Tactic: 1 Time: 0.06144 TRT - VERBOSE Fastest Tactic: 0 Time: 0.03072 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_88 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.103424 TRT - VERBOSE Tactic: 1 Time: 0.06656 TRT - VERBOSE Fastest Tactic: 1 Time: 0.06656 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float((* (# 2 (SHAPE input)) E0),1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_88 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.1792 TRT - VERBOSE Tactic: 1 Time: 0.799744 TRT - VERBOSE Fastest Tactic: 0 Time: 0.1792 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0:32,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_88 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.304128 TRT - VERBOSE Tactic: 1 Time: 0.771072 TRT - VERBOSE Fastest Tactic: 0 Time: 0.304128 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* 64 E1) *************** TRT - VERBOSE --------------- Timing Runner: Conv_15 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_15 (FusedConvActConvolution) TRT - VERBOSE Tactic: 524287 Time: 0.126976 TRT - VERBOSE Tactic: 720895 Time: 0.106496 TRT - VERBOSE Tactic: 983039 Time: 0.09216 TRT - VERBOSE Tactic: 1048575 Time: 0.103424 TRT - VERBOSE Tactic: 1703935 Time: 0.094208 TRT - VERBOSE Tactic: 1769471 Time: 0.11776 TRT - VERBOSE Tactic: 2424831 Time: 0.128 TRT - VERBOSE Tactic: 2621439 Time: 0.096256 TRT - VERBOSE Tactic: 3014655 Time: 0.100352 TRT - VERBOSE Tactic: 3604479 Time: 0.096256 TRT - VERBOSE Tactic: 5046271 Time: 0.099328 TRT - VERBOSE Tactic: 6488063 Time: 0.101376 TRT - VERBOSE Tactic: 7274495 Time: 0.106496 TRT - VERBOSE Tactic: 7864319 Time: 0.096256 TRT - VERBOSE Tactic: 8847359 Time: 0.119808 TRT - VERBOSE Tactic: 9043967 Time: 0.091136 TRT - VERBOSE Tactic: 9961471 Time: 0.120832 TRT - VERBOSE Tactic: 10027007 Time: 0.098304 TRT - VERBOSE Tactic: 10485759 Time: 0.090112 TRT - VERBOSE Tactic: 10682367 Time: 0.095232 TRT - VERBOSE Tactic: 10813439 Time: 0.091136 TRT - VERBOSE Fastest Tactic: 10485759 Time: 0.090112 TRT - VERBOSE --------------- Timing Runner: Conv_15 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.200704 TRT - VERBOSE Tactic: 1 Time: 0.120832 TRT - VERBOSE Tactic: 2 Time: 0.244736 TRT - VERBOSE Tactic: 5 Time: 1.09773 TRT - VERBOSE Tactic: 6 Time: 0.058368 TRT - VERBOSE Tactic: 56 Time: 0.208896 TRT - VERBOSE Tactic: 57 Time: 0.121856 TRT - VERBOSE Tactic: 58 Time: 0.262144 TRT - VERBOSE Tactic: 61 Time: 1.0537 TRT - VERBOSE Tactic: 62 Time: 0.058368 TRT - VERBOSE Tactic: 112 Time: 0.20992 TRT - VERBOSE Tactic: 113 Time: 0.083968 TRT - VERBOSE Tactic: 114 Time: 0.260096 TRT - VERBOSE Tactic: 117 Time: 1.06394 TRT - VERBOSE Tactic: 118 Time: 0.058368 TRT - VERBOSE Fastest Tactic: 6 Time: 0.058368 TRT - VERBOSE --------------- Timing Runner: Conv_15 (CaskConvolution) TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.152576 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.088064 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.08192 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.09728 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.14336 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.136192 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.141312 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.1536 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.08192 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: -7777264329408437990 TRT - VERBOSE Tactic: -7777264329408437990 Time: 0.055296 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.096256 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.096256 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 0.146432 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.139264 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.137216 TRT - VERBOSE Conv_15 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.083968 TRT - VERBOSE Fastest Tactic: -7777264329408437990 Time: 0.055296 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7777264329408437990 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,64) -> Float(E1,1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_15 (CaskConvolution) TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.109568 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.08704 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.31744 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.105472 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.077824 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.101376 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.14848 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.084992 TRT - VERBOSE Fastest Tactic: -9217704540809507511 Time: 0.077824 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -9217704540809507511 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,16) -> Float(E1,1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_15 (CaskConvolution) TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.098304 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.094208 TRT - VERBOSE Conv_15 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.095232 TRT - VERBOSE Fastest Tactic: -7377458734869418330 Time: 0.094208 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7377458734869418330 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* 64 E1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Tactic: 1 Time: 0.017408 TRT - VERBOSE Tactic: 2 Time: 0.017408 TRT - VERBOSE Tactic: 3 Time: 0.017408 TRT - VERBOSE Tactic: 4 Time: 0.018432 TRT - VERBOSE Tactic: 5 Time: 0.019456 TRT - VERBOSE Tactic: 6 Time: 0.019456 TRT - VERBOSE Tactic: 7 Time: 0.018432 TRT - VERBOSE Tactic: 8 Time: 0.018432 TRT - VERBOSE Tactic: 9 Time: 0.017408 TRT - VERBOSE Tactic: 28 Time: 0.017408 TRT - VERBOSE Fastest Tactic: 1 Time: 0.017408 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,64) -> Float(E1,1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.016384 TRT - VERBOSE Tactic: 1 Time: 0.017408 TRT - VERBOSE Tactic: 2 Time: 0.019456 TRT - VERBOSE Tactic: 3 Time: 0.018016 TRT - VERBOSE Tactic: 4 Time: 0.018432 TRT - VERBOSE Tactic: 5 Time: 0.017408 TRT - VERBOSE Tactic: 6 Time: 0.019456 TRT - VERBOSE Tactic: 7 Time: 0.017408 TRT - VERBOSE Tactic: 8 Time: 0.018432 TRT - VERBOSE Tactic: 9 Time: 0.018432 TRT - VERBOSE Tactic: 28 Time: 0.017408 TRT - VERBOSE Fastest Tactic: 0 Time: 0.016384 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,16) -> Float(E1,1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Tactic: 1 Time: 0.017408 TRT - VERBOSE Tactic: 2 Time: 0.018432 TRT - VERBOSE Tactic: 3 Time: 0.018432 TRT - VERBOSE Tactic: 4 Time: 0.018432 TRT - VERBOSE Tactic: 5 Time: 0.017408 TRT - VERBOSE Tactic: 6 Time: 0.019392 TRT - VERBOSE Tactic: 7 Time: 0.018432 TRT - VERBOSE Tactic: 8 Time: 0.022528 TRT - VERBOSE Tactic: 9 Time: 0.019456 TRT - VERBOSE Tactic: 10 Time: 0.018432 TRT - VERBOSE Tactic: 11 Time: 0.019456 TRT - VERBOSE Tactic: 12 Time: 0.018432 TRT - VERBOSE Tactic: 13 Time: 0.018432 TRT - VERBOSE Tactic: 14 Time: 0.019456 TRT - VERBOSE Tactic: 15 Time: 0.018432 TRT - VERBOSE Tactic: 16 Time: 0.019456 TRT - VERBOSE Tactic: 17 Time: 0.018432 TRT - VERBOSE Tactic: 18 Time: 0.017408 TRT - VERBOSE Tactic: 19 Time: 0.017408 TRT - VERBOSE Tactic: 20 Time: 0.018432 TRT - VERBOSE Tactic: 21 Time: 0.018432 TRT - VERBOSE Tactic: 22 Time: 0.018432 TRT - VERBOSE Tactic: 23 Time: 0.018432 TRT - VERBOSE Tactic: 28 Time: 0.017408 TRT - VERBOSE Tactic: 29 Time: 0.017408 TRT - VERBOSE Tactic: 30 Time: 0.017408 TRT - VERBOSE Fastest Tactic: 1 Time: 0.017408 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1:32,E0,1) -> Float(E2,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* 2 E1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWiseV2) TRT - VERBOSE Tactic: 24 Time: 0.017408 TRT - VERBOSE Tactic: 25 Time: 0.018432 TRT - VERBOSE Tactic: 26 Time: 0.018432 TRT - VERBOSE Tactic: 27 Time: 0.018432 TRT - VERBOSE Tactic: 31 Time: 0.018432 TRT - VERBOSE Fastest Tactic: 24 Time: 0.017408 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 24 TRT - VERBOSE *************** Autotuning format combination: Float(1:4,E1,E0,1) -> Float(1:4,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(181 + (Unnamed Layer* 89) [Shuffle], Div_113) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.761856 TRT - VERBOSE Tactic: 1 Time: 0.877568 TRT - VERBOSE Tactic: 2 Time: 0.87552 TRT - VERBOSE Tactic: 3 Time: 0.984064 TRT - VERBOSE Tactic: 4 Time: 1.05267 TRT - VERBOSE Tactic: 5 Time: 0.932864 TRT - VERBOSE Tactic: 6 Time: 1.21139 TRT - VERBOSE Tactic: 7 Time: 1.24314 TRT - VERBOSE Tactic: 8 Time: 1.23802 TRT - VERBOSE Tactic: 9 Time: 1.39878 TRT - VERBOSE Tactic: 10 Time: 0.530432 TRT - VERBOSE Tactic: 11 Time: 0.611328 TRT - VERBOSE Tactic: 12 Time: 0.56832 TRT - VERBOSE Tactic: 13 Time: 0.712704 TRT - VERBOSE Tactic: 14 Time: 0.692224 TRT - VERBOSE Tactic: 15 Time: 0.607232 TRT - VERBOSE Tactic: 16 Time: 0.899072 TRT - VERBOSE Tactic: 17 Time: 0.841728 TRT - VERBOSE Tactic: 18 Time: 0.794624 TRT - VERBOSE Tactic: 19 Time: 0.705536 TRT - VERBOSE Tactic: 20 Time: 0.361472 TRT - VERBOSE Tactic: 21 Time: 0.406528 TRT - VERBOSE Tactic: 22 Time: 0.477184 TRT - VERBOSE Tactic: 23 Time: 0.622592 TRT - VERBOSE Tactic: 28 Time: 0.05632 TRT - VERBOSE Tactic: 29 Time: 0.05632 TRT - VERBOSE Tactic: 30 Time: 0.057344 TRT - VERBOSE Fastest Tactic: 28 Time: 0.05632 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 28 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 64 E1),E1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: ReduceMean_160 (Reduce) TRT - VERBOSE Tactic: 5 Time: 0.008192 TRT - VERBOSE Tactic: 7 Time: 0.012288 TRT - VERBOSE Tactic: 8 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 5 Time: 0.008192 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reduce Tactic: 5 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 64 E1),E1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_134 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Tactic: 1 Time: 0.033792 TRT - VERBOSE Fastest Tactic: 0 Time: 0.018432 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* E2 E0),1,E0,64) -> Float((* E2 E3),1,E3,1) where E0=(MUL_ADD 64 E1 64) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E2=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E3=(+ E1 1) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_134 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Tactic: 1 Time: 0.39424 TRT - VERBOSE Fastest Tactic: 0 Time: 0.024576 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* E2 E0),1:4,E0,16) -> Float((* E2 E3),1:4,E3,1) where E0=(MUL_ADD 16 E1 16) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E2=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E3=(+ E1 1) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_134 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.072704 TRT - VERBOSE Tactic: 1 Time: 0.059392 TRT - VERBOSE Fastest Tactic: 1 Time: 0.059392 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float((* 2 E1),E1:32,E0,1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_134 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 1.23706 TRT - VERBOSE Tactic: 1 Time: 0.38912 TRT - VERBOSE Fastest Tactic: 1 Time: 0.38912 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E0,(# 3 (SHAPE input)),1), Float(E1,E0,(# 3 (SHAPE input)),1), Float(E0,E0,(# 3 (SHAPE input)),1) -> Float(E1,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) E1=(* 32 E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.043008 TRT - VERBOSE Tactic: 1 Time: 0.044032 TRT - VERBOSE Tactic: 2 Time: 0.044032 TRT - VERBOSE Tactic: 3 Time: 0.043008 TRT - VERBOSE Tactic: 4 Time: 0.044032 TRT - VERBOSE Tactic: 5 Time: 0.043008 TRT - VERBOSE Tactic: 6 Time: 0.045056 TRT - VERBOSE Tactic: 7 Time: 0.044032 TRT - VERBOSE Tactic: 8 Time: 0.04608 TRT - VERBOSE Tactic: 9 Time: 0.080896 TRT - VERBOSE Tactic: 28 Time: 0.106496 TRT - VERBOSE Fastest Tactic: 0 Time: 0.043008 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,32), Float(E1,1,E0,32), Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1,(# 3 (SHAPE input)),1) -> Float(E1,1,E0,32) where E0=(* 32 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.095232 TRT - VERBOSE Tactic: 1 Time: 0.151552 TRT - VERBOSE Tactic: 2 Time: 0.134144 TRT - VERBOSE Tactic: 3 Time: 0.23552 TRT - VERBOSE Tactic: 4 Time: 0.227328 TRT - VERBOSE Tactic: 5 Time: 0.2304 TRT - VERBOSE Tactic: 6 Time: 0.474112 TRT - VERBOSE Tactic: 7 Time: 0.361472 TRT - VERBOSE Tactic: 8 Time: 0.365568 TRT - VERBOSE Tactic: 9 Time: 0.449536 TRT - VERBOSE Tactic: 28 Time: 0.149504 TRT - VERBOSE Fastest Tactic: 0 Time: 0.095232 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,8), Float(E1,1:4,E0,8), Float((* (# 2 (SHAPE input)) (# 3 (SHAPE input))),1:4,(# 3 (SHAPE input)),1) -> Float(E1,1:4,E0,8) where E0=(* 8 (# 3 (SHAPE input))) E1=(* (# 2 (SHAPE input)) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.45056 TRT - VERBOSE Tactic: 1 Time: 0.500736 TRT - VERBOSE Tactic: 2 Time: 0.9472 TRT - VERBOSE Tactic: 3 Time: 0.564224 TRT - VERBOSE Tactic: 4 Time: 0.521216 TRT - VERBOSE Tactic: 5 Time: 0.929792 TRT - VERBOSE Tactic: 6 Time: 0.70144 TRT - VERBOSE Tactic: 7 Time: 0.939008 TRT - VERBOSE Tactic: 8 Time: 0.687104 TRT - VERBOSE Tactic: 9 Time: 0.80384 TRT - VERBOSE Tactic: 10 Time: 0.546816 TRT - VERBOSE Tactic: 11 Time: 0.54784 TRT - VERBOSE Tactic: 12 Time: 0.556032 TRT - VERBOSE Tactic: 13 Time: 1.26362 TRT - VERBOSE Tactic: 14 Time: 1.25952 TRT - VERBOSE Tactic: 15 Time: 1.2585 TRT - VERBOSE Tactic: 16 Time: 1.26362 TRT - VERBOSE Tactic: 17 Time: 1.26669 TRT - VERBOSE Tactic: 18 Time: 1.2544 TRT - VERBOSE Tactic: 19 Time: 1.46227 TRT - VERBOSE Tactic: 20 Time: 1.26362 TRT - VERBOSE Tactic: 21 Time: 1.2544 TRT - VERBOSE Tactic: 22 Time: 1.25952 TRT - VERBOSE Tactic: 23 Time: 1.26464 TRT - VERBOSE Tactic: 28 Time: 1.26464 TRT - VERBOSE Tactic: 29 Time: 1.27898 TRT - VERBOSE Tactic: 30 Time: 1.26362 TRT - VERBOSE Fastest Tactic: 0 Time: 0.45056 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0:32,(# 3 (SHAPE input)),1), Float(E0,E0:32,(# 3 (SHAPE input)),1), Float(E0,E0:32,(# 3 (SHAPE input)),1) -> Float(E0,E0:32,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWiseV2) TRT - VERBOSE Tactic: 24 Time: 1.32403 TRT - VERBOSE Tactic: 25 Time: 1.31584 TRT - VERBOSE Tactic: 26 Time: 1.31482 TRT - VERBOSE Tactic: 27 Time: 1.32198 TRT - VERBOSE Tactic: 31 Time: 1.32198 TRT - VERBOSE Fastest Tactic: 26 Time: 1.31482 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 26 TRT - VERBOSE *************** Autotuning format combination: Float(1:4,E0,(# 3 (SHAPE input)),1), Float(1:4,E0,(# 3 (SHAPE input)),1), Float(1:4,E0,(# 3 (SHAPE input)),1) -> Float(1:4,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_90, PWN(Softplus_91)), PWN(PWN(Sub_92, PWN(Softplus_93)), Mul_94)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 16.4557 TRT - VERBOSE Tactic: 1 Time: 18.7709 TRT - VERBOSE Tactic: 2 Time: 31.6385 TRT - VERBOSE Tactic: 3 Time: 13.7114 TRT - VERBOSE Tactic: 4 Time: 33.5749 TRT - VERBOSE Tactic: 5 Time: 16.3041 TRT - VERBOSE Tactic: 6 Time: 9.77203 TRT - VERBOSE Tactic: 7 Time: 15.9304 TRT - VERBOSE Tactic: 8 Time: 9.64198 TRT - VERBOSE Tactic: 9 Time: 10.9916 TRT - VERBOSE Tactic: 10 Time: 3.48672 TRT - VERBOSE Tactic: 11 Time: 4.096 TRT - VERBOSE Tactic: 12 Time: 6.72973 TRT - VERBOSE Tactic: 13 Time: 8.32307 TRT - VERBOSE Tactic: 14 Time: 12.5123 TRT - VERBOSE Tactic: 15 Time: 12.2696 TRT - VERBOSE Tactic: 16 Time: 12.117 TRT - VERBOSE Tactic: 17 Time: 18.0879 TRT - VERBOSE Tactic: 18 Time: 19.0198 TRT - VERBOSE Tactic: 19 Time: 9.96762 TRT - VERBOSE Tactic: 20 Time: 5.50912 TRT - VERBOSE Tactic: 21 Time: 6.4553 TRT - VERBOSE Tactic: 22 Time: 8.02304 TRT - VERBOSE Tactic: 23 Time: 11.6009 TRT - VERBOSE Tactic: 28 Time: 5.18963 TRT - VERBOSE Tactic: 29 Time: 5.2265 TRT - VERBOSE Tactic: 30 Time: 4.9664 TRT - VERBOSE Fastest Tactic: 10 Time: 3.48672 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 10 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 32 E0),E0,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: ReduceMax_95 (Reduce) TRT - VERBOSE Tactic: 5 Time: 0.402432 TRT - VERBOSE Tactic: 7 Time: 0.400384 TRT - VERBOSE Tactic: 8 Time: 0.395264 TRT - VERBOSE Fastest Tactic: 8 Time: 0.395264 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reduce Tactic: 8 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* 64 E1) *************** TRT - VERBOSE --------------- Timing Runner: BatchNormalization_18 + Relu_19 (Scale) TRT - VERBOSE Tactic: 0 Time: 0.391168 TRT - VERBOSE Fastest Tactic: 0 Time: 0.391168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Scale Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,64) -> Float(E1,1,E0,64) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: BatchNormalization_18 + Relu_19 (Scale) TRT - VERBOSE Scale has no valid tactics for this config, skipping TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,16) -> Float(E1,1:4,E0,16) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: BatchNormalization_18 + Relu_19 (Scale) TRT - VERBOSE Scale has no valid tactics for this config, skipping TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 64 E1),E1,E0,1) -> Float((* 128 E3),E3,E2,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E3=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E2) *************** TRT - VERBOSE --------------- Timing Runner: Conv_20 + Relu_21 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_20 + Relu_21 (FusedConvActConvolution) TRT - VERBOSE Tactic: 458751 Time: 0.526336 TRT - VERBOSE Fastest Tactic: 458751 Time: 0.526336 TRT - VERBOSE --------------- Timing Runner: Conv_20 + Relu_21 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 1.02912 TRT - VERBOSE Tactic: 1 Time: 0.858112 TRT - VERBOSE Tactic: 2 Time: 1.88006 TRT - VERBOSE Tactic: 5 Time: 42.8544 TRT - VERBOSE Tactic: 56 Time: 1.05677 TRT - VERBOSE Tactic: 57 Time: 0.826368 TRT - VERBOSE Tactic: 58 Time: 1.8729 TRT - VERBOSE Tactic: 61 Time: 1.82272 TRT - VERBOSE Tactic: 112 Time: 0.12288 TRT - VERBOSE Tactic: 113 Time: 0.140288 TRT - VERBOSE Tactic: 114 Time: 0.145408 TRT - VERBOSE Tactic: 117 Time: 1.7879 TRT - VERBOSE Fastest Tactic: 112 Time: 0.12288 TRT - VERBOSE --------------- Timing Runner: Conv_20 + Relu_21 (CaskConvolution) TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.057344 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.062464 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.065536 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.05632 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.160768 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.080896 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.08192 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.053248 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.067584 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.057344 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.098304 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 0.164864 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.08704 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.080832 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.068608 TRT - VERBOSE Fastest Tactic: 6532229230713743445 Time: 0.053248 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 6532229230713743445 TRT - VERBOSE *************** Autotuning format combination: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1,E0,64) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E1),1,E1,128) where E0=(MUL_ADD 64 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 64) E1=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) *************** TRT - VERBOSE --------------- Timing Runner: Conv_20 + Relu_21 (CaskConvolution) TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.055296 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.050176 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.09728 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.0512 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.078848 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.0512 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.050176 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.049152 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.049152 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE *************** Autotuning format combination: Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0),1:4,E0,16) -> Float((* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E1),1:4,E1,32) where E0=(MUL_ADD 16 (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 16) E1=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) *************** TRT - VERBOSE --------------- Timing Runner: Conv_20 + Relu_21 (CaskConvolution) TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.057344 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.055296 TRT - VERBOSE Conv_20 + Relu_21 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.05632 TRT - VERBOSE Fastest Tactic: -7377458734869418330 Time: 0.055296 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7377458734869418330 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E0,E0,(# 3 (SHAPE input)),1) -> Float(E0,E0,(# 3 (SHAPE input)),1) where E0=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Resize_108 (Resize) TRT - VERBOSE Tactic: 1 Time: 0.00512 TRT - VERBOSE Fastest Tactic: 1 Time: 0.00512 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Resize Tactic: 1 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E5,E5,E4,1) where E0=(+ E2 1) E1=(* (+ E3 1) E0) E2=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E3=(CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) E4=(+ E2 5) E5=(* (+ E3 5) E4) *************** TRT - VERBOSE --------------- Timing Runner: Pad_148 (Padding) TRT - VERBOSE Padding has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Pad_148 (Slice) TRT - VERBOSE Tactic: 0 Time: 0.02048 TRT - VERBOSE Fastest Tactic: 0 Time: 0.02048 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E5,E5,E4,1) where E0=(+ E2 5) E1=(* (+ E3 5) E0) E2=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E3=(CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) E4=(+ E2 1) E5=(* (+ E3 1) E4) *************** TRT - VERBOSE --------------- Timing Runner: Conv_149 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_149 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.457728 TRT - VERBOSE Tactic: 1 Time: 3.50403 TRT - VERBOSE Tactic: 2 Time: 0.524288 TRT - VERBOSE Tactic: 56 Time: 0.457728 TRT - VERBOSE Tactic: 57 Time: 3.5031 TRT - VERBOSE Tactic: 58 Time: 0.523264 TRT - VERBOSE Tactic: 112 Time: 0.456704 TRT - VERBOSE Tactic: 113 Time: 0.16896 TRT - VERBOSE Tactic: 114 Time: 0.523264 TRT - VERBOSE Fastest Tactic: 113 Time: 0.16896 TRT - VERBOSE --------------- Timing Runner: Conv_149 (CaskConvolution) TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 1.07213 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.608256 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.319488 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.51712 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 1.04038 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.694272 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.992256 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 1.10899 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.369664 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: -7491730084094677098 TRT - VERBOSE Tactic: -7491730084094677098 Time: 0.150528 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.538624 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.156672 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x128_relu_interior_nn_v1 Tactic: -6273689210331812572 TRT - VERBOSE Tactic: -6273689210331812572 Time: 0.68608 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 1.0752 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x64_relu_interior_nn_v1 Tactic: -4337126844824617177 TRT - VERBOSE Tactic: -4337126844824617177 Time: 0.306176 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.615424 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.695296 TRT - VERBOSE Conv_149 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.324608 TRT - VERBOSE Fastest Tactic: -7491730084094677098 Time: 0.150528 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7491730084094677098 TRT - VERBOSE *************** Autotuning format combination: Float((* (+ E2 5) E0),1,E0,1) -> Float((* (+ E2 1) E3),1,E3,1) where E0=(+ E1 5) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E2=(CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) E3=(+ E1 1) *************** TRT - VERBOSE --------------- Timing Runner: Conv_149 (CaskConvolution) TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.933888 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.8192 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 5.90131 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.758784 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.755712 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.74752 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 1.38957 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.719872 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.719872 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE *************** Autotuning format combination: Float((* (+ E2 5) E0),1:4,E0,1) -> Float((* (+ E2 1) E3),1:4,E3,1) where E0=(+ E1 5) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E2=(CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) E3=(+ E1 1) *************** TRT - VERBOSE --------------- Timing Runner: Conv_149 (CaskConvolution) TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.83968 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.730112 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 5.94534 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 1.18144 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 1.67629 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.764928 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.749568 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 1.61997 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 1.63021 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 1.37728 TRT - VERBOSE Conv_149 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.713728 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.713728 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float((* 64 E1),E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_159 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Tactic: 1 Time: 0.033792 TRT - VERBOSE Fastest Tactic: 0 Time: 0.018432 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* E2 E0),1,E0,1) -> Float((* E2 E3),1,E3,64) where E0=(+ E1 1) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E2=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E3=(MUL_ADD 64 E1 64) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_159 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.055296 TRT - VERBOSE Tactic: 1 Time: 0.029696 TRT - VERBOSE Fastest Tactic: 1 Time: 0.029696 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float((* E2 E0),1:4,E0,1) -> Float((* E2 E3),1:4,E3,16) where E0=(+ E1 1) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E2=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E3=(MUL_ADD 16 E1 16) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_159 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.068608 TRT - VERBOSE Tactic: 1 Time: 0.401408 TRT - VERBOSE Fastest Tactic: 0 Time: 0.068608 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1:32,E0,1) -> Float((* 2 E1),E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_159 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.075776 TRT - VERBOSE Tactic: 1 Time: 0.372736 TRT - VERBOSE Fastest Tactic: 0 Time: 0.075776 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) E2=(* 128 E1) *************** TRT - VERBOSE --------------- Timing Runner: Conv_22 + Relu_23 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_22 + Relu_23 (FusedConvActConvolution) TRT - VERBOSE Tactic: 524287 Time: 0.104448 TRT - VERBOSE Tactic: 720895 Time: 0.101376 TRT - VERBOSE Tactic: 983039 Time: 0.093184 TRT - VERBOSE Tactic: 1048575 Time: 0.09728 TRT - VERBOSE Tactic: 1703935 Time: 0.08704 TRT - VERBOSE Tactic: 1769471 Time: 0.09728 TRT - VERBOSE Tactic: 2424831 Time: 0.105472 TRT - VERBOSE Tactic: 2621439 Time: 0.090112 TRT - VERBOSE Tactic: 3014655 Time: 0.082944 TRT - VERBOSE Tactic: 3604479 Time: 0.086016 TRT - VERBOSE Tactic: 5046271 Time: 0.095232 TRT - VERBOSE Tactic: 6488063 Time: 0.094208 TRT - VERBOSE Tactic: 7274495 Time: 0.10752 TRT - VERBOSE Tactic: 7864319 Time: 0.09216 TRT - VERBOSE Tactic: 8847359 Time: 0.106496 TRT - VERBOSE Tactic: 9043967 Time: 0.086016 TRT - VERBOSE Tactic: 9961471 Time: 0.101376 TRT - VERBOSE Tactic: 10027007 Time: 0.093184 TRT - VERBOSE Tactic: 10485759 Time: 0.086016 TRT - VERBOSE Tactic: 10682367 Time: 0.088064 TRT - VERBOSE Tactic: 10813439 Time: 0.079872 TRT - VERBOSE Fastest Tactic: 10813439 Time: 0.079872 TRT - VERBOSE --------------- Timing Runner: Conv_22 + Relu_23 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.183296 TRT - VERBOSE Tactic: 1 Time: 0.119808 TRT - VERBOSE Tactic: 2 Time: 0.186368 TRT - VERBOSE Tactic: 4 skipped. Scratch requested: 8725266432, available: 1073741824 TRT - VERBOSE Tactic: 5 Time: 1.03322 TRT - VERBOSE Tactic: 6 Time: 0.067584 TRT - VERBOSE Tactic: 56 Time: 0.182272 TRT - VERBOSE Tactic: 57 Time: 0.118784 TRT - VERBOSE Tactic: 58 Time: 0.181248 TRT - VERBOSE Tactic: 60 skipped. Scratch requested: 8725266432, available: 1073741824 TRT - VERBOSE Tactic: 61 Time: 1.03117 TRT - VERBOSE Tactic: 62 Time: 0.0656 TRT - VERBOSE Tactic: 112 Time: 0.182272 TRT - VERBOSE Tactic: 113 Time: 0.21504 TRT - VERBOSE Tactic: 114 Time: 0.18432 TRT - VERBOSE Tactic: 116 skipped. Scratch requested: 8725266432, available: 1073741824 TRT - VERBOSE Tactic: 117 Time: 1.0199 TRT - VERBOSE Tactic: 118 Time: 0.065536 TRT - INFO Some tactics do not have sufficient workspace memory to run. Increasing workspace size will enable more tactics, please check verbose output for requested sizes. TRT - VERBOSE Fastest Tactic: 118 Time: 0.065536 TRT - VERBOSE Setting workspace to 8725266432enables more tactics for profiling TRT - VERBOSE --------------- Timing Runner: Conv_22 + Relu_23 (CaskConvolution) TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.082944 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.088064 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.1024 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.093184 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.259072 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.131072 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.13312 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.083968 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.105472 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: -7777264329408437990 TRT - VERBOSE Tactic: -7777264329408437990 Time: 0.050176 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.09216 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.132096 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 0.262144 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.136192 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.131072 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.110592 TRT - VERBOSE Fastest Tactic: -7777264329408437990 Time: 0.050176 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7777264329408437990 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,128) -> Float(E1,1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_22 + Relu_23 (CaskConvolution) TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.093184 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.0768 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.146432 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.09216 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.119808 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.093184 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.0768 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.075776 TRT - VERBOSE Fastest Tactic: -824800713406371346 Time: 0.075776 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -824800713406371346 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,32) -> Float(E1,1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_22 + Relu_23 (CaskConvolution) TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.090112 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.08704 TRT - VERBOSE Conv_22 + Relu_23 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.088064 TRT - VERBOSE Fastest Tactic: -7377458734869418330 Time: 0.08704 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7377458734869418330 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1), Float(E2,E1,E0,1), Float(E1,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* 64 E1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Tactic: 1 Time: 0.024576 TRT - VERBOSE Tactic: 2 Time: 0.022528 TRT - VERBOSE Tactic: 3 Time: 0.024576 TRT - VERBOSE Tactic: 4 Time: 0.024576 TRT - VERBOSE Tactic: 5 Time: 0.023552 TRT - VERBOSE Tactic: 6 Time: 0.024576 TRT - VERBOSE Tactic: 7 Time: 0.024576 TRT - VERBOSE Tactic: 8 Time: 0.023552 TRT - VERBOSE Tactic: 9 Time: 0.024576 TRT - VERBOSE Tactic: 28 Time: 0.023552 TRT - VERBOSE Fastest Tactic: 2 Time: 0.022528 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 2 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,64), Float(E1,1,E0,64), Float((* E3 E4),1,E4,1) -> Float(E1,1,E0,64) where E0=(MUL_ADD 64 E2 64) E1=(* E3 E0) E2=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E3=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E4=(+ E2 1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Tactic: 1 Time: 0.024576 TRT - VERBOSE Tactic: 2 Time: 0.024576 TRT - VERBOSE Tactic: 3 Time: 0.024576 TRT - VERBOSE Tactic: 4 Time: 0.024576 TRT - VERBOSE Tactic: 5 Time: 0.023552 TRT - VERBOSE Tactic: 6 Time: 0.0256 TRT - VERBOSE Tactic: 7 Time: 0.024576 TRT - VERBOSE Tactic: 8 Time: 0.024576 TRT - VERBOSE Tactic: 9 Time: 0.024576 TRT - VERBOSE Tactic: 28 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 5 Time: 0.023552 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 5 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,16), Float(E1,1:4,E0,16), Float((* E3 E4),1:4,E4,1) -> Float(E1,1:4,E0,16) where E0=(MUL_ADD 16 E2 16) E1=(* E3 E0) E2=(CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) E3=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E4=(+ E2 1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.024576 TRT - VERBOSE Tactic: 1 Time: 0.024576 TRT - VERBOSE Tactic: 2 Time: 0.034816 TRT - VERBOSE Tactic: 3 Time: 0.024576 TRT - VERBOSE Tactic: 4 Time: 0.049152 TRT - VERBOSE Tactic: 5 Time: 0.050176 TRT - VERBOSE Tactic: 6 Time: 0.027648 TRT - VERBOSE Tactic: 7 Time: 0.048128 TRT - VERBOSE Tactic: 8 Time: 0.033792 TRT - VERBOSE Tactic: 9 Time: 0.032768 TRT - VERBOSE Tactic: 10 Time: 0.0256 TRT - VERBOSE Tactic: 11 Time: 0.0256 TRT - VERBOSE Tactic: 12 Time: 0.026624 TRT - VERBOSE Tactic: 13 Time: 0.024576 TRT - VERBOSE Tactic: 14 Time: 0.0256 TRT - VERBOSE Tactic: 15 Time: 0.024576 TRT - VERBOSE Tactic: 16 Time: 0.0256 TRT - VERBOSE Tactic: 17 Time: 0.0256 TRT - VERBOSE Tactic: 18 Time: 0.028672 TRT - VERBOSE Tactic: 19 Time: 0.029696 TRT - VERBOSE Tactic: 20 Time: 0.024576 TRT - VERBOSE Tactic: 21 Time: 0.0256 TRT - VERBOSE Tactic: 22 Time: 0.024576 TRT - VERBOSE Tactic: 23 Time: 0.024576 TRT - VERBOSE Tactic: 28 Time: 0.023552 TRT - VERBOSE Tactic: 29 Time: 0.024576 TRT - VERBOSE Tactic: 30 Time: 0.024576 TRT - VERBOSE Fastest Tactic: 28 Time: 0.023552 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 28 TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1:32,E0,1), Float(E2,E1:32,E0,1), Float(E1,E1:32,E0,1) -> Float(E2,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* 2 E1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWiseV2) TRT - VERBOSE Tactic: 24 Time: 0.0256 TRT - VERBOSE Tactic: 25 Time: 0.0256 TRT - VERBOSE Tactic: 26 Time: 0.0256 TRT - VERBOSE Tactic: 27 Time: 0.0256 TRT - VERBOSE Tactic: 31 Time: 0.0256 TRT - VERBOSE Fastest Tactic: 24 Time: 0.0256 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 24 TRT - VERBOSE *************** Autotuning format combination: Float(1:4,E1,E0,1), Float(1:4,E1,E0,1), Float(1:4,E1,E0,1) -> Float(1:4,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(PWN(Sub_161, PWN(Softplus_162)), PWN(PWN(Sub_163, PWN(Softplus_164)), Mul_165)) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.98304 TRT - VERBOSE Tactic: 1 Time: 1.1223 TRT - VERBOSE Tactic: 2 Time: 1.8944 TRT - VERBOSE Tactic: 3 Time: 1.29741 TRT - VERBOSE Tactic: 4 Time: 1.16326 TRT - VERBOSE Tactic: 5 Time: 1.2073 TRT - VERBOSE Tactic: 6 Time: 1.64352 TRT - VERBOSE Tactic: 7 Time: 1.27488 TRT - VERBOSE Tactic: 8 Time: 1.37114 TRT - VERBOSE Tactic: 9 Time: 1.56877 TRT - VERBOSE Tactic: 10 Time: 0.569344 TRT - VERBOSE Tactic: 11 Time: 0.679936 TRT - VERBOSE Tactic: 12 Time: 0.744448 TRT - VERBOSE Tactic: 13 Time: 0.787456 TRT - VERBOSE Tactic: 14 Time: 1.11411 TRT - VERBOSE Tactic: 15 Time: 1.07315 TRT - VERBOSE Tactic: 16 Time: 1.00557 TRT - VERBOSE Tactic: 17 Time: 1.43872 TRT - VERBOSE Tactic: 18 Time: 1.58003 TRT - VERBOSE Tactic: 19 Time: 0.723968 TRT - VERBOSE Tactic: 20 Time: 0.431104 TRT - VERBOSE Tactic: 21 Time: 0.574464 TRT - VERBOSE Tactic: 22 Time: 0.729088 TRT - VERBOSE Tactic: 23 Time: 1.05472 TRT - VERBOSE Tactic: 28 Time: 0.080896 TRT - VERBOSE Tactic: 29 Time: 0.080896 TRT - VERBOSE Tactic: 30 Time: 0.082944 TRT - VERBOSE Fastest Tactic: 28 Time: 0.080896 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 28 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 64 E1),E1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: ReduceMax_166 (Reduce) TRT - VERBOSE Tactic: 5 Time: 0.012288 TRT - VERBOSE Tactic: 7 Time: 0.012288 TRT - VERBOSE Tactic: 8 Time: 0.013312 TRT - VERBOSE Fastest Tactic: 5 Time: 0.012288 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reduce Tactic: 5 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) E2=(* 128 E1) *************** TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,128) -> Float(E1,1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,32) -> Float(E1,1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E2,E2,(# 3 (SHAPE input)),1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -2) 2) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -2) 2) 1) E0) E2=(* (# 2 (SHAPE input)) (# 3 (SHAPE input))) *************** TRT - VERBOSE --------------- Timing Runner: Resize_179 (Resize) TRT - VERBOSE Tactic: 1 Time: 0.006144 TRT - VERBOSE Fastest Tactic: 1 Time: 0.006144 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Resize Tactic: 1 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) E2=(* 128 E1) *************** TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,128) -> Float(E1,1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,32) -> Float(E1,1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) E2=(* 128 E1) *************** TRT - VERBOSE --------------- Timing Runner: Conv_28 (CudaDepthwiseConvolution) TRT - VERBOSE CudaDepthwiseConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_28 (FusedConvActConvolution) TRT - VERBOSE Tactic: 524287 Time: 0.105472 TRT - VERBOSE Tactic: 720895 Time: 0.1024 TRT - VERBOSE Tactic: 983039 Time: 0.096256 TRT - VERBOSE Tactic: 1048575 Time: 0.098304 TRT - VERBOSE Tactic: 1703935 Time: 0.088064 TRT - VERBOSE Tactic: 1769471 Time: 0.096256 TRT - VERBOSE Tactic: 2424831 Time: 0.106496 TRT - VERBOSE Tactic: 2621439 Time: 0.089088 TRT - VERBOSE Tactic: 3014655 Time: 0.084992 TRT - VERBOSE Tactic: 3604479 Time: 0.089088 TRT - VERBOSE Tactic: 5046271 Time: 0.096256 TRT - VERBOSE Tactic: 6488063 Time: 0.095232 TRT - VERBOSE Tactic: 7274495 Time: 0.10752 TRT - VERBOSE Tactic: 7864319 Time: 0.093184 TRT - VERBOSE Tactic: 8847359 Time: 0.105472 TRT - VERBOSE Tactic: 9043967 Time: 0.08704 TRT - VERBOSE Tactic: 9961471 Time: 0.1024 TRT - VERBOSE Tactic: 10027007 Time: 0.094208 TRT - VERBOSE Tactic: 10485759 Time: 0.084992 TRT - VERBOSE Tactic: 10682367 Time: 0.088064 TRT - VERBOSE Tactic: 10813439 Time: 0.079872 TRT - VERBOSE Fastest Tactic: 10813439 Time: 0.079872 TRT - VERBOSE --------------- Timing Runner: Conv_28 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.171008 TRT - VERBOSE Tactic: 1 Time: 0.109568 TRT - VERBOSE Tactic: 2 Time: 0.176128 TRT - VERBOSE Tactic: 4 skipped. Scratch requested: 8725266432, available: 1073741824 TRT - VERBOSE Tactic: 5 Time: 1.01069 TRT - VERBOSE Tactic: 6 Time: 0.055296 TRT - VERBOSE Tactic: 56 Time: 0.171008 TRT - VERBOSE Tactic: 57 Time: 0.109568 TRT - VERBOSE Tactic: 58 Time: 0.173056 TRT - VERBOSE Tactic: 60 skipped. Scratch requested: 8725266432, available: 1073741824 TRT - VERBOSE Tactic: 61 Time: 1.00864 TRT - VERBOSE Tactic: 62 Time: 0.05632 TRT - VERBOSE Tactic: 112 Time: 0.172032 TRT - VERBOSE Tactic: 113 Time: 0.198656 TRT - VERBOSE Tactic: 114 Time: 0.176128 TRT - VERBOSE Tactic: 116 skipped. Scratch requested: 8725266432, available: 1073741824 TRT - VERBOSE Tactic: 117 Time: 1.00147 TRT - VERBOSE Tactic: 118 Time: 0.055296 TRT - VERBOSE Fastest Tactic: 6 Time: 0.055296 TRT - VERBOSE Setting workspace to 8725266432enables more tactics for profiling TRT - VERBOSE --------------- Timing Runner: Conv_28 (CaskConvolution) TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.08192 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.088064 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.104448 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.093184 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.258048 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.131072 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.13312 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.083968 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.105472 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: -7777264329408437990 TRT - VERBOSE Tactic: -7777264329408437990 Time: 0.0512 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.093184 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x32_relu_small_nn_v1 Tactic: -6313876406580483184 TRT - VERBOSE Tactic: -6313876406580483184 Time: 0.132096 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_aligna4_alignc4 Tactic: -4932505327461806800 TRT - VERBOSE Tactic: -4932505327461806800 Time: 0.26112 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -2870780723387717246 TRT - VERBOSE Tactic: -2870780723387717246 Time: 0.136192 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x128_relu_medium_nn_v1 Tactic: -1123676555321336786 TRT - VERBOSE Tactic: -1123676555321336786 Time: 0.131072 TRT - VERBOSE Conv_28 Set Tactic Name: ampere_scudnn_128x64_relu_medium_nn_v1 Tactic: -701551393537224327 TRT - VERBOSE Tactic: -701551393537224327 Time: 0.110592 TRT - VERBOSE Fastest Tactic: -7777264329408437990 Time: 0.0512 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7777264329408437990 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,128) -> Float(E1,1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_28 (CaskConvolution) TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 1852818056038333088 TRT - VERBOSE Tactic: 1852818056038333088 Time: 0.094208 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 2137271960938803767 TRT - VERBOSE Tactic: 2137271960938803767 Time: 0.075776 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x256x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 4543151729465212056 TRT - VERBOSE Tactic: 4543151729465212056 Time: 0.144384 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 7098622778606028509 TRT - VERBOSE Tactic: 7098622778606028509 Time: 0.090112 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize256x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: -9217704540809507511 TRT - VERBOSE Tactic: -9217704540809507511 Time: 0.118784 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -7734972403155710907 TRT - VERBOSE Tactic: -7734972403155710907 Time: 0.094208 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -3360299822783115645 TRT - VERBOSE Tactic: -3360299822783115645 Time: 0.0768 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nhwckrsc_nhwc_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: -824800713406371346 TRT - VERBOSE Tactic: -824800713406371346 Time: 0.075776 TRT - VERBOSE Fastest Tactic: 2137271960938803767 Time: 0.075776 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: 2137271960938803767 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,32) -> Float(E1,1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Conv_28 (CaskConvolution) TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_indexed_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: 7342025736444949634 TRT - VERBOSE Tactic: 7342025736444949634 Time: 0.089088 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8_t1r3s3 Tactic: -7377458734869418330 TRT - VERBOSE Tactic: -7377458734869418330 Time: 0.08704 TRT - VERBOSE Conv_28 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_tf32f32_f32_nhwckrsc_nhwc_tilesize128x128x16_stage4_warpsize2x2x1_g1_tensor16x8x8 Tactic: -5457304872213719461 TRT - VERBOSE Tactic: -5457304872213719461 Time: 0.088064 TRT - VERBOSE Fastest Tactic: -7377458734869418330 Time: 0.08704 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: CaskConvolution Tactic: -7377458734869418330 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1,E0,1) -> Float(E2,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) E2=(* 128 E1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.008192 TRT - VERBOSE Tactic: 1 Time: 0.01024 TRT - VERBOSE Tactic: 2 Time: 0.009216 TRT - VERBOSE Tactic: 3 Time: 0.008192 TRT - VERBOSE Tactic: 4 Time: 0.011264 TRT - VERBOSE Tactic: 5 Time: 0.008192 TRT - VERBOSE Tactic: 6 Time: 0.007168 TRT - VERBOSE Tactic: 7 Time: 0.01024 TRT - VERBOSE Tactic: 8 Time: 0.0072 TRT - VERBOSE Tactic: 9 Time: 0.007168 TRT - VERBOSE Tactic: 28 Time: 0.009152 TRT - VERBOSE Fastest Tactic: 6 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 6 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1,E0,128) -> Float(E1,1,E0,128) where E0=(MUL_ADD 128 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 128) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.009216 TRT - VERBOSE Tactic: 1 Time: 0.009216 TRT - VERBOSE Tactic: 2 Time: 0.007168 TRT - VERBOSE Tactic: 3 Time: 0.008128 TRT - VERBOSE Tactic: 4 Time: 0.008192 TRT - VERBOSE Tactic: 5 Time: 0.008192 TRT - VERBOSE Tactic: 6 Time: 0.008192 TRT - VERBOSE Tactic: 7 Time: 0.008192 TRT - VERBOSE Tactic: 8 Time: 0.01024 TRT - VERBOSE Tactic: 9 Time: 0.009216 TRT - VERBOSE Tactic: 28 Time: 0.01024 TRT - VERBOSE Fastest Tactic: 2 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 2 TRT - VERBOSE *************** Autotuning format combination: Float(E1,1:4,E0,32) -> Float(E1,1:4,E0,32) where E0=(MUL_ADD 32 (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 32) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Tactic: 1 Time: 0.011264 TRT - VERBOSE Tactic: 2 Time: 0.008192 TRT - VERBOSE Tactic: 3 Time: 0.009216 TRT - VERBOSE Tactic: 4 Time: 0.009216 TRT - VERBOSE Tactic: 5 Time: 0.01024 TRT - VERBOSE Tactic: 6 Time: 0.013312 TRT - VERBOSE Tactic: 7 Time: 0.011264 TRT - VERBOSE Tactic: 8 Time: 0.017408 TRT - VERBOSE Tactic: 9 Time: 0.013312 TRT - VERBOSE Tactic: 10 Time: 0.01024 TRT - VERBOSE Tactic: 11 Time: 0.008192 TRT - VERBOSE Tactic: 12 Time: 0.01024 TRT - VERBOSE Tactic: 13 Time: 0.011264 TRT - VERBOSE Tactic: 14 Time: 0.008192 TRT - VERBOSE Tactic: 15 Time: 0.008192 TRT - VERBOSE Tactic: 16 Time: 0.012288 TRT - VERBOSE Tactic: 17 Time: 0.01024 TRT - VERBOSE Tactic: 18 Time: 0.009216 TRT - VERBOSE Tactic: 19 Time: 0.009216 TRT - VERBOSE Tactic: 20 Time: 0.009216 TRT - VERBOSE Tactic: 21 Time: 0.009216 TRT - VERBOSE Tactic: 22 Time: 0.007168 TRT - VERBOSE Tactic: 23 Time: 0.008192 TRT - VERBOSE Tactic: 28 Time: 0.009216 TRT - VERBOSE Tactic: 29 Time: 0.01024 TRT - VERBOSE Tactic: 30 Time: 0.009216 TRT - VERBOSE Fastest Tactic: 22 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 22 TRT - VERBOSE *************** Autotuning format combination: Float(E2,E1:32,E0,1) -> Float(E2,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) E2=(* 4 E1) *************** TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWise) TRT - VERBOSE PointWise has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWiseV2) TRT - VERBOSE Tactic: 24 Time: 0.008192 TRT - VERBOSE Tactic: 25 Time: 0.008192 TRT - VERBOSE Tactic: 26 Time: 0.009216 TRT - VERBOSE Tactic: 27 Time: 0.011264 TRT - VERBOSE Tactic: 31 Time: 0.008192 TRT - VERBOSE Fastest Tactic: 24 Time: 0.008192 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 24 TRT - VERBOSE *************** Autotuning format combination: Float(1:4,E1,E0,1) -> Float(1:4,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: PWN(266 + (Unnamed Layer* 140) [Shuffle], Div_184) (PointWiseV2) TRT - VERBOSE Tactic: 0 Time: 0.382976 TRT - VERBOSE Tactic: 1 Time: 0.44032 TRT - VERBOSE Tactic: 2 Time: 0.438272 TRT - VERBOSE Tactic: 3 Time: 0.492544 TRT - VERBOSE Tactic: 4 Time: 0.528384 TRT - VERBOSE Tactic: 5 Time: 0.467968 TRT - VERBOSE Tactic: 6 Time: 0.606208 TRT - VERBOSE Tactic: 7 Time: 0.622592 TRT - VERBOSE Tactic: 8 Time: 0.618496 TRT - VERBOSE Tactic: 9 Time: 0.700416 TRT - VERBOSE Tactic: 10 Time: 0.267264 TRT - VERBOSE Tactic: 11 Time: 0.3072 TRT - VERBOSE Tactic: 12 Time: 0.28672 TRT - VERBOSE Tactic: 13 Time: 0.3584 TRT - VERBOSE Tactic: 14 Time: 0.347104 TRT - VERBOSE Tactic: 15 Time: 0.306176 TRT - VERBOSE Tactic: 16 Time: 0.451584 TRT - VERBOSE Tactic: 17 Time: 0.422912 TRT - VERBOSE Tactic: 18 Time: 0.39936 TRT - VERBOSE Tactic: 19 Time: 0.354304 TRT - VERBOSE Tactic: 20 Time: 0.182272 TRT - VERBOSE Tactic: 21 Time: 0.2048 TRT - VERBOSE Tactic: 22 Time: 0.24064 TRT - VERBOSE Tactic: 23 Time: 0.313344 TRT - VERBOSE Tactic: 28 Time: 0.03072 TRT - VERBOSE Tactic: 29 Time: 0.03072 TRT - VERBOSE Tactic: 30 Time: 0.031744 TRT - VERBOSE Fastest Tactic: 28 Time: 0.03072 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: PointWiseV2 Tactic: 28 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 128 E1),E1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: ReduceMean_231 (Reduce) TRT - VERBOSE Tactic: 5 Time: 0.007168 TRT - VERBOSE Tactic: 7 Time: 0.017408 TRT - VERBOSE Tactic: 8 Time: 0.017408 TRT - VERBOSE Fastest Tactic: 5 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Reduce Tactic: 5 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float((* 128 E1),E1,E0,1) -> Float(E1,E1,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_205 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.007168 TRT - VERBOSE Tactic: 1 Time: 0.019456 TRT - VERBOSE Fastest Tactic: 0 Time: 0.007168 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* E2 E0),1,E0,128) -> Float((* E2 E3),1,E3,1) where E0=(MUL_ADD 128 E1 128) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) E2=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E3=(+ E1 1) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_205 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.013312 TRT - VERBOSE Tactic: 1 Time: 0.202752 TRT - VERBOSE Fastest Tactic: 0 Time: 0.013312 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 0 TRT - VERBOSE *************** Autotuning format combination: Float((* E2 E0),1:4,E0,32) -> Float((* E2 E3),1:4,E3,1) where E0=(MUL_ADD 32 E1 32) E1=(CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) E2=(+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E3=(+ E1 1) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_205 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.036864 TRT - VERBOSE Tactic: 1 Time: 0.034816 TRT - VERBOSE Fastest Tactic: 1 Time: 0.034816 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE *************** Autotuning format combination: Float((* 4 E1),E1:32,E0,1) -> Float(E1,E1:32,E0,1) where E0=(+ (CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) 1) E1=(* (+ (CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) 1) E0) *************** TRT - VERBOSE --------------- Timing Runner: Reshape_205 (Shuffle) TRT - VERBOSE Tactic: 0 Time: 0.60928 TRT - VERBOSE Tactic: 1 Time: 0.200704 TRT - VERBOSE Fastest Tactic: 1 Time: 0.200704 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Shuffle Tactic: 1 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E5,E5,E4,1) where E0=(+ E2 1) E1=(* (+ E3 1) E0) E2=(CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) E3=(CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) E4=(+ E2 3) E5=(* (+ E3 3) E4) *************** TRT - VERBOSE --------------- Timing Runner: Pad_219 (Padding) TRT - VERBOSE Padding has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Pad_219 (Slice) TRT - VERBOSE Tactic: 0 Time: 0.012288 TRT - VERBOSE Fastest Tactic: 0 Time: 0.012288 TRT - VERBOSE >>>>>>>>>>>>>>> Chose Runner Type: Slice Tactic: 0 TRT - VERBOSE =============== Computing costs for TRT - VERBOSE *************** Autotuning format combination: Float(E1,E1,E0,1) -> Float(E5,E5,E4,1) where E0=(+ E2 3) E1=(* (+ E3 3) E0) E2=(CEIL_DIV (+ (# 3 (SHAPE input)) -4) 4) E3=(CEIL_DIV (+ (# 2 (SHAPE input)) -4) 4) E4=(+ E2 1) E5=(* (+ E3 1) E4) *************** TRT - VERBOSE --------------- Timing Runner: Conv_220 (CudaDepthwiseConvolution) TRT - VERBOSE Tactic: -1 Time: 0.016384 TRT - VERBOSE Fastest Tactic: -1 Time: 0.016384 TRT - VERBOSE --------------- Timing Runner: Conv_220 (FusedConvActConvolution) TRT - VERBOSE FusedConvActConvolution has no valid tactics for this config, skipping TRT - VERBOSE --------------- Timing Runner: Conv_220 (CudnnConvolution) TRT - VERBOSE Tactic: 0 Time: 0.018432 TRT - VERBOSE Tactic: 1 Time: 1.54726 TRT - VERBOSE Tactic: 2 Time: 0.24576 TRT - VERBOSE Tactic: 4 Time: 0.25088 TRT - VERBOSE Tactic: 5 Time: 0.178176 TRT - VERBOSE Tactic: 6 Time: 0.218112 TRT - VERBOSE Tactic: 56 Time: 0.02048 TRT - VERBOSE Tactic: 57 Time: 1.54522 TRT - VERBOSE Tactic: 58 Time: 0.244736 TRT - VERBOSE Tactic: 60 Time: 0.227328 TRT - VERBOSE Tactic: 61 Time: 0.171008 TRT - VERBOSE Tactic: 62 Time: 0.216064 TRT - VERBOSE Tactic: 112 Time: 0.019456 TRT - VERBOSE Tactic: 113 Time: 0.038912 TRT - VERBOSE Tactic: 114 Time: 0.25088 TRT - VERBOSE Tactic: 116 Time: 0.227328 TRT - VERBOSE Tactic: 117 Time: 0.16896 TRT - VERBOSE Tactic: 118 Time: 0.216064 TRT - VERBOSE Fastest Tactic: 0 Time: 0.018432 TRT - VERBOSE --------------- Timing Runner: Conv_220 (CaskConvolution) TRT - VERBOSE Conv_220 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 130477820174479366 TRT - VERBOSE Tactic: 130477820174479366 Time: 0.466944 TRT - VERBOSE Conv_220 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize128x64x8_stage3_warpsize2x2x1_g1_ffma_aligna4_alignc4 Tactic: 1358952225285826925 TRT - VERBOSE Tactic: 1358952225285826925 Time: 0.269312 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_128x64_relu_small_nn_v1 Tactic: 4549827808004681195 TRT - VERBOSE Tactic: 4549827808004681195 Time: 0.144384 TRT - VERBOSE Conv_220 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5127140674766136213 TRT - VERBOSE Tactic: 5127140674766136213 Time: 0.227328 TRT - VERBOSE Conv_220 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize256x128x8_stage3_warpsize4x2x1_g1_ffma_t1r3s3_aligna4_alignc4 Tactic: 5691674214365884252 TRT - VERBOSE Tactic: 5691674214365884252 Time: 0.461824 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_128x128_relu_small_nn_v1 Tactic: 5779835512569528575 TRT - VERBOSE Tactic: 5779835512569528575 Time: 0.305152 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_128x128_relu_xregs_large_nn_v1 Tactic: 6053873026024413720 TRT - VERBOSE Tactic: 6053873026024413720 Time: 0.431104 TRT - VERBOSE Conv_220 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x128x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: 6532229230713743445 TRT - VERBOSE Tactic: 6532229230713743445 Time: 0.485376 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_128x64_relu_xregs_large_nn_v1 Tactic: 6767548733843469815 TRT - VERBOSE Tactic: 6767548733843469815 Time: 0.166912 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile148t_nt_v1 Tactic: -7777264329408437990 TRT - VERBOSE Tactic: -7777264329408437990 Time: 0.206848 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_128x32_relu_interior_nn_v1 Tactic: -7491730084094677098 TRT - VERBOSE Tactic: -7491730084094677098 Time: 0.069632 TRT - VERBOSE Conv_220 Set Tactic Name: sm80_xmma_fprop_implicit_gemm_f32f32_f32f32_f32_nchwkcrs_nchw_tilesize64x64x8_stage3_warpsize1x4x1_g1_ffma_aligna4_alignc4 Tactic: -6693149634808211969 TRT - VERBOSE Tactic: -6693149634808211969 Time: 0.236544 TRT - VERBOSE Conv_220 Set Tactic Name: ampere_scudnn_winograd_128x128_ldg1_ldg4_relu_tile442t_nt_v1 Tactic: -6664441261382767776 TRT - VERBOSE Deleting timing cache: 294 entries, served 335 hits since creation. TRT - ERROR 1: [convolutionRunner.cpp::nvinfer1::rt::task::CaskConvolutionRunner::onShapeChange::153] Error Code 1: Cask ( Failed to update runtime arguments.)