Error with pytorch model without BN fusing when running QAT?

johnminho · August 9, 2023, 5:20pm

I follow this guide for yolov7 qat https://github.com/NVIDIA-AI-IOT/yolo_deepstream/tree/main/yolov7_qat

In this repo BN layer in fused into Conv layer before ptq. Now I don’t want to fuse BN layer into Conv layers. I comment out 2 lines https://github.com/NVIDIA-AI-IOT/yolo_deepstream/blob/5af35bab7f6dfca7f1f32d44847b2a91786485f4/yolov7_qat/scripts/qat.py#L79 but when running I got an error

Traceback (most recent call last):
  File "scripts/qat_BN.py", line 338, in <module>
    cmd_quantize(
  File "scripts/qat_BN.py", line 179, in cmd_quantize
    quantize.apply_custom_rules_to_quantizer(model, export_onnx)
  File "/yolov7_custom_dataset/quantization/quantize.py", line 222, in apply_custom_rules_to_quantizer
    export_onnx(model, "quantization-custom-rules-temp.onnx")
  File "scripts/qat_BN.py", line 138, in export_onnx
    quantize.export_onnx(model, dummy, file, opset_version=13, 
  File "/yolov7_custom_dataset/quantization/quantize.py", line 394, in export_onnx
    torch.onnx.export(model, input, file, *args, **kwargs)
  File "/usr/local/lib/python3.8/dist-packages/torch/onnx/utils.py", line 506, in export
    _export(
  File "/usr/local/lib/python3.8/dist-packages/torch/onnx/utils.py", line 1548, in _export
    graph, params_dict, torch_out = _model_to_graph(
  File "/usr/local/lib/python3.8/dist-packages/torch/onnx/utils.py", line 1180, in _model_to_graph
    params_dict = _C._jit_pass_onnx_constant_fold(
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select)

Do you any have suggesstion for me? this was happen when convert to onnx

spolisetty · August 10, 2023, 5:29pm

Hi,

We are moving this post to the Deepstream forum to get better help.

Thank you.

mchi · August 11, 2023, 2:05am

why don’t you want to fuse BN layer into Conv layers?

johnminho · August 11, 2023, 3:13am

@mchi

why don’t you want to fuse BN layer into Conv layers?

Thanks for response.

As I mentioned above, in this repo https://github.com/NVIDIA-AI-IOT/yolo_deepstream/tree/main/yolov7_qat BN layer in fused into Conv layer with model.fuse() before ptq. But after ptq, fine tuning is processed. I want to keep statistics of BN layer (got from the training process), it may be results in better mAP. To do that, I comment out model.fuse(), run ptq and qat finetuning, but converting from .onnx model to .engine model I got the above error.

haowang · September 5, 2023, 6:05am

I do not think we can got better mAP if not fuse BN. QAT just do finetune. will not greatly change the weight. So fuse BN is the better option. See paper: https://arxiv.org/pdf/2004.09602.pdf

system · September 20, 2023, 7:33am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Support multiple GPUs for QAT YOLOv7 DeepStream SDK	3	433	July 28, 2023
Possible Solutions to INT64 clamping accuracy drop DeepStream SDK tensorrt	11	515	March 11, 2024
Integration model pose estimation in Deepstream SDK DeepStream SDK	15	1215	April 6, 2023
Error during run inference with custom yolov5 model in deepstream DeepStream SDK deepstream	8	94	September 17, 2024
[8] Assertion failed: ctx->network()->hasExplicitPrecision() && "TensorRT only supports multi-input conv for explicit precision QAT networks!" TensorRT	3	684	May 11, 2021
yolov3-tiny for parsing a onnx model: concat error TensorRT	9	3152	June 4, 2019
Converting yolov7 to deepstream DeepStream SDK	5	1560	June 9, 2023
Fake quantization ONNX model parse ERROR using TensorRT TensorRT	6	1775	September 26, 2021
ERR with YOLOv4 to TensoRT DeepStream SDK tensorrt	8	1474	October 12, 2021
Converting yolov4 model to engine file in deepstream DeepStream SDK jetson , deepstream	6	19	January 9, 2025

Error with pytorch model without BN fusing when running QAT?

Related topics