Triton inference server with tensorRT, Error code 1 : Cask (Cask convolution execution)

lsh950223 · September 11, 2023, 2:21am

Description

Hello.
i had a problem.

i was creatived to Triton inference server
my onnx model was working in triton inference server but, tensorRT model doesn’t working in my server.

here is my problem, i was attached two picture

First, this is my nvidia-driver toolkit and CUDA

Second, My problem in triton inference server

please understanding whatever i have weak English skills :((

Thank you for reading my Topic

Environment

TensorRT Version: 8.6.16
GPU Type: NVIDIA GeForce RTX 3060
Nvidia Driver Version: 536.40
CUDA Version: 12.2
CUDNN Version: 12.1
Operating System + Version: windows 11
Python Version (if applicable): 3.9

AakankshaS · September 11, 2023, 6:07am

Hi,
We recommend you to raise this query in TRITON Inference Server Github instance issues section.

Thanks!

Topic		Replies	Views
Issue while Serving the model using Triton Server Triton Inference Server (archived) inference-server-triton	0	524	November 18, 2020
Error: Cuda Runtime (no kernel image is available for execution on the device) TensorRT	4	1714	September 25, 2023
Converting Custom ONNX model to TensorRT engine TensorRT cudnn	1	638	November 30, 2024
Error Code 1: Cask (Cask convolution execution) TensorRT tensorrt , cuda	3	1991	March 4, 2024
Error Code 1: Cudnn (CUDNN_STATUS_EXECUTION_FAILED) TensorRT cuda	3	2322	May 31, 2022
Conversion from onnx to TensorRT engine TensorRT tensorrt , cuda	1	530	July 24, 2023
Parseq tensorrt conversion takes for ever to complete TensorRT cudnn	1	111	August 30, 2024
Onnx to tensorrt inference fail TensorRT tensorrt	1	558	August 24, 2021
Got Cudnn Error in executeConv: 3 (CUDNN_STATUS_BAD_PARAM) TensorRT tensorrt , cuda , ubuntu	1	1463	February 21, 2022
Cuda Runtime Error when infering Onnx model TensorRT	3	1041	October 11, 2021

Triton inference server with tensorRT, Error code 1 : Cask (Cask convolution execution)

Description

Environment

Related topics