Sparse convolution using tensorrt

soohyung.zhang · January 15, 2023, 5:47pm

As I understand sparsity inference using tensorrt we need following process.

Use pytorch to find optimal pth(dense network) results after learning.
Reproduce the pth file through sparse relearning using ASP in apex.
Convert the reproduced pth to onnx.
Convert onnx to tensorrt plan file (.trt).

./workspace/TensorRT/build/out/trtexec \
–onnx=/workspace/TensorRT/model/resnext101_32x8d_pyt_torchvision_sparse.onnx \ –saveEngine=/workspace/TensorRT/model/resnext101_engine.trt
–explicitBatch
–sparsity=enable
–fp16

inference the plan file using tensorrt.

I have a question here.
Does tensorrt inference using 2-bit indices (shown in the figure above) information in addition to sparse matrix data?
I don’t think pth, onnx, trt files are structures containing 2 bit indices information. How can tensorrt use the 2 bit indices information which is shown in the picture above?

spolisetty · January 18, 2023, 11:35am

Hi,

Please refer to the following docs, hope they are helpful. Please let us know if your query is still not answered.

Thank you.

soohyung.zhang · January 20, 2023, 11:14am

Thank you!

system · February 3, 2023, 11:15am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Add support to working with data-dependent tensor shapes TensorRT	2	368	December 31, 2023
2:4 sparsity doesnot improve inference performance on RTX 3090 TensorRT tensorrt	14	3042	September 9, 2022
Sparse tensor math speedup on Ampere TensorRT tensorrt , cuda	1	347	December 20, 2023
Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT Technical Blog	13	2709	June 2, 2023
Sparsity does not provide any speedup for TensorRT on DLA Jetson AGX Orin cudnn	6	749	January 22, 2024
Structure Sparsity not working with BERT large TensorRT	11	1007	July 7, 2022
How does tensorRT behave? Jetson Nano tensorrt	2	336	February 9, 2022
TensorRT python API inference is inconsistent with trtexec inference TensorRT tensorrt	1	936	February 28, 2023
Differences between tensorflow model inference and tensorRT model inference TensorRT tensorrt , tensorflow	6	1564	September 14, 2022
Tensorrt 8.6 GA : C++ Inference gives diffrence results compared to onnx \|\| pt model python inference TensorRT	3	609	September 20, 2023

Sparse convolution using tensorrt

Related topics