YOLOv4 Model Pruning - Extreme Prune Ratios and mAP Drop

fatemeh.nikdelfaz · March 19, 2025, 3:15pm

• Hardware: NVIDIA GeForce RTX 4090
• Network Type: Yolo_v4
• TLT Version: TAO 5.5.0
• Training spec file:
yolo_v4_train_cspdarknet19_kitti_seq.txt (2.6 KB)
yolo_v4_retrain_cspdarknet19_kitti_seq.txt (2.6 KB)

I am using YOLOv4 with my own dataset, and I encountered some issues when pruning my model. Before training, my model’s mAP is around 0.50. However, when I try to prune with different thresholds (the suggested ones and many others), I always get either an extremely low prune ratio (less than 0.008) or an extremely high prune ratio (greater than 0.98).

I am unable to achieve a prune ratio between 10-20%, which is typically recommended. Additionally, even when I retrain with a prune ratio of 1 or 0.008, my mAP drops to 0 after retraining.

Any insights on what might be causing this or how to resolve it? Thanks in advance.

Morganh · March 20, 2025, 8:54am

This does not make sense. You can refer to YOLOv4 — Tao Toolkit to try more arguments for pruning experiments.
BTW, an example is that another use can get 0.57 of pruning ratio.

fatemeh.nikdelfaz · March 24, 2025, 5:45pm

Thanks, Morganh!

I tried different pruning parameters as you suggested, and I was able to achieve different pruning ratios. However, when I retrained the network, I still got 0 mAP.

I found that changing the model_ema parameter to FALSE solved the issue. Interestingly, there’s no explanation about model_ema in the YOLO documentation, but in the original retrain spec file, it is set to TRUE by default. Setting it to FALSE resolved the problem.

I also have another question: I couldn’t find any documentation stating that the pruning ratio should ideally be between 10-20% as I found here. My understanding is that it should be set based on the specific problem, correct? For example, when I set the ratio to 0.14 and retrained the model, it reached the original accuracy in less than 5 epochs, which I guess it is because it wasn’t pruned enough.

Would love to hear your thoughts on this!

Morganh · April 4, 2025, 11:39am

The ideal pruning ratio depends on the specific problem and model architecture. There is not a hard rule. Pruning involves removing less important weights or neurons from a network to reduce its size and computational cost. Retraining after pruning is essential to recover any lost accuracy. If you’re getting 0 mAP after retraining, it could mean the pruning was too aggressive or the retraining process wasn’t effective. Suggest you to prune a bit → retrain → prune a bit → retrain.

The ModelEMA class is used for Exponential Moving Average (EMA) of model weights, which can help stabilize training and improve model performance. However, it seems that setting model_ema to FALSE resolved your issue with pruning and retraining. This might indicate that the EMA mechanism was interfering with your pruning and retraining process, possibly due to how it updates model weights.

system · April 18, 2025, 11:40am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Tao yolov4 pruned model is stuck at 6.5 FPS TAO Toolkit	18	299	September 3, 2024
Retrain frcnn model mAP down TAO Toolkit	6	492	October 12, 2021
How to retrain the model after pruning? TAO Toolkit	12	1712	October 12, 2021
Some problems with the pruned model TAO Toolkit	9	576	August 21, 2023
Probleme with training/pruning tlt TAO Toolkit yolo	10	1086	October 12, 2021
Error in Retraining pruned model TAO Toolkit tensorrt , yolo , training , tao	7	1175	February 13, 2022
Prune rate always 1 for whatever pth value TAO Toolkit	8	937	February 7, 2022
Yolov4, no bounding box was showed when using Re-Train Model TAO Toolkit	13	534	July 9, 2023
Error when trying to retrain yolo_v4 TAO Toolkit	7	1094	October 31, 2022
Retraining a pruned model has low AP, precision, recall and RPN recall TAO Toolkit	2	369	November 17, 2023

YOLOv4 Model Pruning - Extreme Prune Ratios and mAP Drop

Related topics