Topics tagged tensorrt-model-optimizer

Topic	Replies	Views	Activity
Real-Time Inference on Thor & RTX Pi0.5/GR00T N1.6/1.7 Thor 23 Hz RTX 5090 50-80Hz Jetson Thor cuda , kernel , jetson-inference , tensorrt-model-optimizer	4	277	May 6, 2026
Jetson Thor - INT8 quantization show no performance gain over FP16 (2) Jetson Thor jetson , tensorrt-model-optimizer , jetson-orin	5	372	March 27, 2026
Nvfp4 Dynamic Quantizer Very Slow with Bias TensorRT cudnn , tensorrt-model-optimizer	3	95	January 29, 2026
Jetson Thor - INT8 quantization show no performance gain over FP16 Jetson Thor tensorrt , jetson-inference , tensorrt-model-optimizer	7	486	January 26, 2026
Quantized GeMM using fp32 for Q/DQ layers TensorRT cudnn , tensorrt-model-optimizer	0	66	January 2, 2026
Worse performance after quantization on TensorRT TensorRT cudnn , onnx , tensorrt-model-optimizer , jetson-orin	1	251	December 23, 2025
Failing to load pruned model yolov7 using tensorrt model opt TensorRT tensorrt-model-optimizer	1	66	November 26, 2025
TensorRT Quantization for Jetson Inference Jetson Orin Nano tensorrt , jetson-inference , jetson , tensorrt-model-optimizer	3	490	October 17, 2025
ConvNeXT inference with int8 quantization slower on tensorRT than fp32/fp16 TensorRT cudnn , tensorrt-model-optimizer	2	339	September 19, 2025
30% slowdown on ResNet50 with ModelOptimizer INT8 quantization (RTX 4090) Computer Vision & Image Processing tensorrt-model-optimizer	0	145	September 19, 2025
Holistically-Nested Edge Detection using TensoRT Jetson AGX Xavier tensorrt , deepstream , tensorrt-model-optimizer	7	373	March 18, 2025
Errors with training flux of sparsity with accelerate TensorRT tensorrt-model-optimizer	2	121	March 4, 2025
TensorRT examples TensorRT tensorrt , cuda , tensorflow , cudnn , tensorrt-model-optimizer	1	121	February 28, 2025
How to use same tensor rt version of Jetson orin nano in desktop PC environment Jetson Orin Nano tensorrt , cudnn , jetson , tensorrt-model-optimizer	3	195	February 26, 2025
INT8 Calibration with DS 6.3 worse than with DS 6.0 DeepStream SDK tensorrt , jetson , deepstream , tensorrt-model-optimizer	19	444	February 24, 2025
Is there a plan to support DLA on the next TensorRT version? Jetson AGX Orin tensorrt , nvbugs , dla , tensorrt-model-optimizer	4	399	December 4, 2024
[TRT] jetson agx orion error - CaffeParser: Could not open file device GPU, failed to load networks/Googlenet/bvlc_googlenet.caffemodel Jetson AGX Xavier jetson-inference , tf-trt , tensorrt-model-optimizer	3	150	October 18, 2024
Improving the speed for fp32 for yolov10x inference from Ultralytics on Jetson AGX Orin 64g devkit Jetson AGX Orin tensorrt , camera , yolo , python , tensorrt-model-optimizer	4	251	September 18, 2024
Converting an ONNX model to TensorRT Engine on a x86/64 PC and then using it on a Jetson TensorRT tensorrt , jetson-inference , jetson , jetson-nano , tensorrt-model-optimizer	2	178	August 3, 2024
[New] Discord channel for triton-inference-server, tensorrt, tensorrt-llm, model-optimization TensorRT tensorrt , inference-server-triton , tensorrt-model-optimizer , llm	0	244	July 16, 2024
TensorRT 10.2 is not using FP8 convolution tactics when building a FP8 quantized conv model TensorRT tensorrt , tensorrt-model-optimizer	2	367	July 10, 2024
GPUs hang when executing NIM docker container on a 4xA100 TensorRT cuda , cudnn , tensorrt-model-optimizer	2	204	June 29, 2024
/TopK_5: K exceeds the maximum value allowed (3840) TensorRT tensorrt , cudnn , onnx , tensorrt-model-optimizer	0	554	June 11, 2024