Optimizer = torch.optim.SGD()

andy8902 · September 22, 2020, 8:01am

I use this line “optimizer = torch.optim.SGD(model.parameters(), args.lr, momentum=args.momentum, weight_decay=args.weight_decay)” to do L2 regularization to prevent overfitting. Generally, regularization only penalizes the weight W parameter of the model, and the bias parameter b does not penalize, but there is a network saying that the weight decay specified by the optimizer weight_decay parameter of torch.optim is for all parameters in the network , Including the weight w and bias b for simultaneous punishment. Is that right?

Reference URL: pytorch实现L2和L1正则化regularization的方法_PKing666666的博客-CSDN博客_torch正则化

AastaLLL · September 23, 2020, 3:03am

Hi,

You can find some torch.optim.SGD introduction in the following document:
https://pytorch.org/docs/stable/optim.html#torch.optim.SGD
https://pytorch.org/docs/stable/_modules/torch/optim/sgd.html#SGD

In general, SGD is an optimizer for a trainable parameter.
Both weight and bias are trainable parameter so it will be applied to both of them.

Thanks.

andy8902 · September 23, 2020, 6:28am

How to not set the bias parameter b and only set the weight w？

AastaLLL · September 24, 2020, 3:36am

Hi,

You can check this comment for some information:

Thanks.

Topic		Replies	Views
Googlenet on jetson nano Jetson Nano	4	467	October 14, 2021
How exactly are you supposed to do explicit quantization? TensorRT	1	75	March 4, 2025
Can anyone please explain the result (graph) of this training progress Jetson Xavier NX ai-training	4	551	November 17, 2022
Train.py use early stoping method Jetson Nano ai-training	7	707	October 18, 2021
Re-training SSD-Mobilenet: gt_locations consist of nan values which causing Regression Loss to NaN Jetson Nano ai-training	2	922	September 13, 2022
Retrain the DetectNet_v2 model on custon data TAO Toolkit jetson	10	38	April 14, 2025
PyTorch inferencer from scratch using trained weights from NVidia Modulus Wave_1D example Technical Support (PhysicsNeMo Only) python	1	892	July 19, 2022
How to use custom-trained image classification model with dstest2_sgie3_config.txt DeepStream SDK	2	368	November 17, 2023
Custom SSD_v2 model is not convert to TRT_engine Jetson Nano tensorrt	14	997	October 18, 2021
Problems Training Models Jetson Nano ai-training	19	4671	January 12, 2022

Optimizer = torch.optim.SGD()

Related topics