Training Speed is too low While training

user86169 · April 7, 2022, 7:02am

TLT Version → docker_tag: v3.21.08-py3
Network Type → Yolov4
Config File → spec.txt (2.5 KB)

Hi,

After training command I observe that the training speed is too low. I ran training part till 120 epochs and it tooks 5 hrs to complete. I am not able to understand why it is taking that much time in order to complete 120 epochs.

I have also attached the configuration file for your reference.

Morganh · April 8, 2022, 2:03am

Please use latest 3.21.11 docker.
In 3.21.08, below setting is not correct. It is compatible with 3.21.11 docker.
loss_loc_weight: 1.0
loss_neg_obj_weights: 1.0
loss_class_weights: 1.0

More improving training speed, please consider
• use AMP if your GPU supports it. See more in Optimizing the Training Pipeline — TAO Toolkit 3.21.11 documentation
• try tfrecord data loader. In this way, please disable mosaic. See more in YOLOv4 — TAO Toolkit 3.21.11 documentation

system · May 3, 2022, 3:00am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Training speed issue TAO Toolkit	2	451	November 15, 2022
TLT yolo_v4 slow training TAO Toolkit	11	838	October 12, 2021
Training becomes very slow and got killed TAO Toolkit	2	646	February 8, 2022
Extremely slow train and evaluation of yolo_v4_tiny TAO Toolkit yolo , tao	12	1223	April 12, 2023
Tao-toolkit RTX 3090 perform worse training speed than RTX 2080 ti TAO Toolkit	4	1009	November 9, 2021
TAO yolov4_tiny training sub-task crashes after number of epochs TAO Toolkit	10	363	September 1, 2022
Training got killed before start TAO Toolkit	18	1427	February 8, 2022
Training got killed while applying transfer learning using freeze blocks TAO Toolkit	2	413	February 8, 2022
Training becomes very slow and got killed while using single class TensorRT	6	598	January 19, 2022
LPD Training Spec for YoloV4? TAO Toolkit	2	366	August 16, 2022

Training Speed is too low While training

Related topics