Hi,
We didn’t meet the OpenCV issue.
It seems iter_image_recordio_2.cc can work in our environment.
However, we meet another CUDA related issue.
We are checking this issue internally.
Will share more information with you later.
$ python3 finetune.py
[13:00:20] /home/nvidia/topic_175013/mxnet/src/nnvm/legacy_json_util.cc:209: Loading symbol saved by previous version v0.8.0. Attempting to upgrade...
[13:00:20] /home/nvidia/topic_175013/mxnet/src/nnvm/legacy_json_util.cc:217: Symbol successfully upgraded!
[13:00:20] /home/nvidia/topic_175013/mxnet/src/io/iter_image_recordio_2.cc:178: ImageRecordIOParser2: ./caltech-256-60-train.rec, use 4 threads for decoding..
[13:00:27] /home/nvidia/topic_175013/mxnet/src/io/iter_image_recordio_2.cc:178: ImageRecordIOParser2: ./caltech-256-60-val.rec, use 4 threads for decoding..
Traceback (most recent call last):
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/symbol/symbol.py", line 1938, in simple_bind
ctypes.byref(exe_handle)))
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/base.py", line 246, in check_call
raise get_last_ffi_error()
mxnet.base.MXNetError: Traceback (most recent call last):
File "/home/nvidia/topic_175013/mxnet/src/engine/./../common/cuda_utils.h", line 395
CUDA: Check failed: e == cudaSuccess || e == cudaErrorCudartUnloading: invalid device ordinal
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "finetune.py", line 214, in <module>
mod_score = fit(new_sym, new_args, aux_params, train, val, batch_size, num_gpus)
File "finetune.py", line 196, in fit
eval_metric='acc')
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/module/base_module.py", line 498, in fit
for_training=True, force_rebind=force_rebind)
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/module/module.py", line 429, in bind
state_names=self._state_names)
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/module/executor_group.py", line 280, in __init__
self.bind_exec(data_shapes, label_shapes, shared_group)
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/module/executor_group.py", line 384, in bind_exec
shared_group))
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/module/executor_group.py", line 678, in _bind_ith_exec
shared_buffer=shared_data_arrays, **input_shapes)
File "/home/nvidia/.local/lib/python3.6/site-packages/mxnet/symbol/symbol.py", line 1944, in simple_bind
raise RuntimeError(error_msg)
RuntimeError: simple_bind error. Arguments:
data: (16, 3, 224, 224)
softmax_label: (16,)
Traceback (most recent call last):
File "/home/nvidia/topic_175013/mxnet/src/engine/./../common/cuda_utils.h", line 395
CUDA: Check failed: e == cudaSuccess || e == cudaErrorCudartUnloading: invalid device ordinal
Thanks.