int8 fails for group convolutions (depthwise) on Xavier

miguel.deprado · July 3, 2019, 1:16pm

Hello!

I am trying to make mobilenet-v1 work on the Jetson Xavier using all int8.

I followed the documentation to use the function cudnnConvolutionForward with the right params:

algo => _IMPLICIT_PRECOMP_GEMM

config => INT8_CONFIG

layout=> _NHWC

For the normal convolution, it was working well. However, it does not work for grouped convolution using the same setup (no problem when using F16 or F32). Isnt it supported?

Surprisingly, if I set the following params, both normal and group convolution work:

algo => _IMPLICIT_GEMM

config => INT8_CONFIG

layout=> _NCHW

Following this last setup, normal convolution works as fast as with the first (supposed to be the right) one and also grouped convs work, although very slowly. How is it possible that setting a wrong layout and algo, normal convs work and even so fast?

Also, I’d like to ask if there are some function to convert F32 to S8 and also, if there are layout conversion API as well.

Thanks a lot!

Topic		Replies	Views
grouped (aka depthwise-separable) convolutions for int8 TensorRT	6	1777	October 12, 2021
cuDNN v6 INT8 convolution failing with CUDNN_STATUS_NOT_SUPPORTED cuDNN	12	5304	March 3, 2020
Cudnn convolution performance(fp32, fp16. int8) on the jetson xavier cuDNN	3	1044	June 14, 2022
「cudnn8.2」how to use conv3d kernel in cudnn with INT8? cuDNN	1	907	August 16, 2021
cudnn 7.0 does not support int8 convolution? GPU-Accelerated Libraries	0	650	March 13, 2018
Performance regression of conv2d INT8 on cudnn 8 cuDNN	2	780	January 14, 2022
Contradictive expression in cudnn Developer Guild 7.1.4 cuDNN	0	520	January 7, 2019
Supported cudnn layers for int8 inference GPU-Accelerated Libraries	1	1096	October 18, 2017
cuDNN How to correctly use CUDNN_DATA_INT8x4 GPU-Accelerated Libraries	3	1550	October 22, 2017
Why Convolution in 8bits with CUDNN6.0 takes more time than fp32 convolution? GPU-Accelerated Libraries	0	517	October 10, 2017

int8 fails for group convolutions (depthwise) on Xavier

Related topics