Can anyone please explain the result (graph) of this training progress

ART97 · November 14, 2022, 9:35pm

Hi

I recently trained a custom ssd mobilenet model. Below is how the training graph looks like

Capture

As we can see, there are lots of small peaks where the loss increase and then decreased. We can see 6-7 peaks like this. As per my understanding, the loss should keep on decreasing. Is there any reason for such graph?

AastaLLL · November 15, 2022, 2:44am

Hi,

Would you mind using a different step size or optimization approach to see if it helps?

Thanks.

ART97 · November 17, 2022, 11:29am

Hi AastaLLL

Can you please explain how can I define step size or other optimizations? I am only using train_ssd.py to train. Although the model is performing fine but I just wanted to understand the training graph that’s why I posted the question.

dusty_nv · November 17, 2022, 2:58pm

Hi @ART97, there are a bunch of learning rate options and optimizer options that you can set on the command-line to train_ssd.py found here: https://github.com/dusty-nv/pytorch-ssd/blob/21383204c68846bfff95acbbd93d39914a77c707/train_ssd.py#L60

Although admittedly I haven’t messed with these or run this for more than 100 epochs. I believe that may lead to overfitting your model (i.e. attaining the lowest possible loss on your training set doesn’t always generalize the better real-world performance)

system · December 7, 2022, 5:23am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.