Digits 6.0 train caffe model error

2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.source …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.backend …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.source …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.backend …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.source …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.backend …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.source …
2019-06-13 20:04:15 [20190613-200413-bd2b] [WARNING] Ignoring data_param.backend …
2019-06-13 20:04:15 [20190613-200413-bd2b] [DEBUG] Network sanity check - train
2019-06-13 20:04:15 [20190613-200413-bd2b] [DEBUG] Network sanity check - val
2019-06-13 20:04:15 [20190613-200413-bd2b] [DEBUG] Network sanity check - deploy
2019-06-13 20:04:15 [20190613-200413-bd2b] [INFO ] Train Caffe Model task started.
2019-06-13 20:04:15 [20190613-200413-bd2b] [INFO ] Task subprocess args: “/home/user/caffe-0.15.1/build/tools/caffe train --solver=/home/user/DIGITS-6.0.0/digits/jobs/20190613-200413-bd2b/solver.prototxt --gpu=0 --weights=/home/user/DIGITS-6.0.0/digits/jobs/20190613-193819-b6c8/model.caffemodel”
2019-06-13 20:04:17 [20190613-200413-bd2b] [ERROR] Train Caffe Model task failed with error code 1

I installed digits6.0 with NV-caffe-0.15.1 and follow the tutorial in git to start object detection. I can’t get efficient information from the error message. Can anyone help me?

Thanks!!

minist task can be perfectly finished in digits. What’s the meaning of error code 1?Here is the information shown in local host:

Train Caffe Model Error

Initialized at 08:04:13 PM (1 second)
Running at 08:04:15 PM (2 seconds)
Error at 08:04:17 PM
(Total - 3 seconds)

[b]ERROR: error code 1

Memory required for data: 1274879360
Creating layer bbox_loss
Creating Layer bbox_loss
bbox_loss ← bboxes-obj-masked-norm
bbox_loss ← bbox-obj-label-norm
bbox_loss → loss_bbox
Setting up bbox_loss
Top shape: (1)
with loss weight 2
Memory required for data: 1274879364
Creating layer coverage_loss
Creating Layer coverage_loss
coverage_loss ← coverage_coverage/sig_0_split_0
coverage_loss ← coverage-label_slice-label_4_split_0
coverage_loss → loss_coverage
Setting up coverage_loss
Top shape: (1)
with loss weight 1
Memory required for data: 1274879368
Creating layer cluster
[/b]

I think the cluster layer was not created successfully. Since it’s not clear about your entire environment, can you try to run the same task with container? You can pull one at our NGC website https://ngc.nvidia.com. You don’t have to login, just click the ‘Explore Accelerated Software’ and continue as guest. The DIGITS container has two flavors, tensorflow and nvcaffe. Please pull the nvcaffe one

docker pull nvcr.io/nvidia/digits:19.05-caffe

.