Questions regarding keywords in tensorRT log file

joseph398 · August 11, 2020, 9:55am

I was going over the tensorRT log while building engine and came upon these following keywords.
caskConvolution
CudaConvolution
CudadepthwiseConvolution
FusedConvActConvolution
LegacySASSConvolution

They were shown like this.
[TensorRT] VERBOSE: --------------- Timing Runner: FeatureExtractor/MobilenetV2/Conv/Conv2D + FeatureExtractor/MobilenetV2/Conv/Relu6 (LegacySASSConvolution)

What exactly do they mean? What are the differences between them?

AakankshaS · August 11, 2020, 7:41pm

Hi @joseph398,
These are part of Profiling.
During the build phase, all possible tactics are tried and timed. Profiling this portion of the execution will not show any meaningful performance measurements and will include all possible kernels, not the ones actually selected for inference.
The below links might be helpful in understanding the same.
https://docs.nvidia.com/deeplearning/tensorrt/best-practices/index.html#nvprof

Thanks!

Topic		Replies	Views
List of kernels and what they stand for TensorRT	0	817	May 28, 2019
What do PWN(...), \|\|, and + mean in TensorRT profile/build logs? Any official definitions? TensorRT tensorrt , profiling	3	128	September 1, 2025
TensorRT 2x slower than Cudnn for single Conv2D (74 ms vs. 156 ms) TensorRT	6	929	February 5, 2021
TensorRT 6 slower than TensorFlow with 3D convolutions and pooling TensorRT	6	1632	December 20, 2019
Nsight System profile tells volta_scudnn while using RTX 2080 Ti Profiling x86 Windows Targets	3	1224	October 12, 2021
What's the difference between Cuda Cores kernels (icudnn, hcudnn and scudnn) and Tensor Cores Kernels (h884 and i8816)? TensorRT	3	1553	October 12, 2021
How tactic generated and layer fusion work in TensorRT? TensorRT	2	631	October 12, 2021
TensorRT6.0 parse a ssd model from caffe ,get a error. inference: nvPluginsLegacy.cpp:1026: virtual void nvinfer1::plugin::DetectionOutputLegacy::configure(const nvinfer1::Dims, int, const nvinfer1::Dims, int, int): Assertion `C2 == inputDims[param.in TensorRT	1	503	March 11, 2020
Tensorrt select algorithm tactic? TensorRT	4	2507	September 23, 2022
How to know what type of optimization have been done to my model when using trtexec TensorRT tensorrt , cudnn , deep-learning	2	386	March 20, 2024

Questions regarding keywords in tensorRT log file

Related topics