Why tensorrt's performance is poor after adding custom op

Shaquille · July 11, 2024, 1:56pm

Hi, NV experts:
I have a custom op which is not supported by tensorrt
so, I add it as a plugin into tensorrt
I found the whole cost time is improve about 10ms
my test as following:

I remove this custom op from my onnx file, and export it as .plan file through trtexec, and the cost of whole network is about 50ms;
I add this custom op(just cudaMemcpy a little data) into my onnx file, and export it as .plan file through trtexec, and the cost of whole network is about 60ms;
I let my code return directly in the enque function, I found the cost of whole network is still about 60ms, the code like this:

int MyPluginDynamic::enqueue(const nvinfer1::PluginTensorDesc* inputDesc,
                        const nvinfer1::PluginTensorDesc* outputDesc,
                        const void* const* inputs, void* const* outputs,
                        void* workspace, cudaStream_t stream) TRT_NOEXCEPT {
    return 0;   //return directly
}

I don’t know why trt’s performance is poor after I adding a little custom op, I guess:

there are some secrect about trt which I don’t know.
my op import extra overhead which I don’t know;
So, Is there anyone would like to teach me this secret?

Shaquille · July 19, 2024, 6:59am

no body would like to help me?

Topic		Replies	Views
TensorRT costomized plug-in affect TensorflowLite model accuracy TensorRT tensorrt , tensorflow	4	568	January 9, 2021
Add custom tactics for TensorRT TensorRT tensorrt	3	561	October 12, 2021
Tensorrt has incorrect output for networks with custom plugins TensorRT	1	428	March 31, 2023
Best Practice About Plugins TensorRT tensorrt , cuda	3	590	February 7, 2022
TensorRT Custom RoiAlign plugin is very slow TensorRT	10	2596	October 12, 2021
TensorRT conversion from tensorflow with custom op TensorRT tensorrt , tensorflow	5	1556	August 12, 2023
TensorRt inference is taking 1.5 sec to inference a single frame.i want to speed up my inference TensorRT tensorrt , jetson-inference , jetson-nano	1	986	March 13, 2023
Lower performance with TRT than plain TF? Jetson Xavier NX tensorrt , jetson-inference	14	2145	October 18, 2021
How to optimize custom tensorrt plugin performance DeepStream SDK tensorrt , jetson	2	323	March 1, 2024
Tensor RT optimization causes performance downgrade compared to onnx model TensorRT	4	1100	January 26, 2022

Why tensorrt's performance is poor after adding custom op

Related topics