thank you, I fixed the issue and made it a lib GitHub - ELS-RD/transformer-deploy: Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Bpnet sample code error | 13 | 775 | October 11, 2022 | |
EfficientDet in Deepstream Causes a Seg Fault | 15 | 1068 | July 19, 2022 | |
Failed to create .engine File | 33 | 2037 | July 11, 2022 | |
TensorRT8 INT8 (signed char) I/O interface for ONNX model | 4 | 1364 | February 15, 2022 | |
TAO toolkit fails to convert RetinaNet INT8 etlt model to INT8 CUDA engine (calibration cache needs to be deleted?) | 4 | 458 | June 10, 2022 | |
Post-Training Quantization (PTQ) for semantic segmentation model running on Jetson Orin NX | 24 | 219 | March 26, 2025 | |
Is the deepstream_lpr_app supposed to work with DS 7.0? | 7 | 235 | May 17, 2024 | |
Tensorrt fp32 inference slower than pytorch on tesla T4 for groundingDINO | 1 | 554 | January 22, 2024 | |
Low performance when running pipeline with RTX 4090 | 24 | 542 | March 21, 2024 | |
Tensorrt can not speed up well | 7 | 1612 | June 29, 2022 |