DIGITS 5, Caffe with cuDNN: Why does accuracy start over from 0 when using a pre-trained network?

snarky · May 1, 2017, 3:04am

I am using Digits 5 and CUDA 8, on Ubuntu 14.04, on Amazon K80-based instances.

I trained a network for 30 epochs, and it was doing alright, and accuracy was still climbing when it stopped.

I wanted to “keep training” which I impemented as:

new training job
same data set
pre-trained model
without customization

When I start training, the 0th epoch is showing the accucacy of the incoming model.
However, as soon as training starts, the accuracy dives down to “no better than random” (there are 20 classes; accuracy is 5%)
It takes about as long to start climbing in accuracy again as it did when I originally trained the model.

Something is corrupting all the progress made in the pre-trained model.
What would cause this?

Topic		Replies	Views
DIGITS accuracy not showing in the chart Deep Learning (Training & Inference)	0	419	October 15, 2018
DIGITS: Deep Learning GPU Training System Technical Blog	54	727	January 7, 2025
Detectnet training appears slower CUDA Programming and Performance	0	402	November 3, 2016
AWS K80 Docker Docker and NVIDIA Docker	2	987	June 4, 2020
Easy Multi-GPU Deep Learning with DIGITS 2 Technical Blog	34	670	January 28, 2016
cuDNN starting point / example applications GPU-Accelerated Libraries	0	440	March 13, 2018
cuDNN8: extreamly slow first iteration of CNN training or inference cuDNN	3	1726	December 30, 2021
Setting up Digits 4 Ubuntu Box CUDA Setup and Installation	3	808	August 23, 2016
Loss of accuracy with cuCdiv? CUDA Programming and Performance	1	1024	February 23, 2012
[DIGITS] Retrain custom classes on DetectNet, model doesn't converge. Deep Learning (Training & Inference)	0	511	June 20, 2018

DIGITS 5, Caffe with cuDNN: Why does accuracy start over from 0 when using a pre-trained network?

Related topics