Failed to deserialize model using Imagenet-Console

rtk1357 · June 14, 2018, 11:19pm

Hi,

I’m trying to use imagenet-console from jetson-inference on a custom model. It works fine with FP16, but for FP32, I get the following error:

[GIE] network profiling complete, writing code to …caffemodel.1.tensorcache
[GIE] completed writing cache to …caffemodel.1.tensorcache
[GIE] …caffemodel loaded
[GIE] failed to allocate -1 bytes to deserialize model
failed to load …caffemodel
imageNet – failed to initialize
imagenet-console: failed to initialize imageNet

How can I fix this? Thanks.

AastaLLL · June 15, 2018, 6:35am

Hi,

In case this issue comes from loading the incorrect FP16 cache file.
Could you delete the …caffemodel.1.tensorcache file and try it again?

Thanks.

rtk1357 · June 15, 2018, 11:40pm

Hi,

That doesn’t appear to be the issue.

AastaLLL · June 19, 2018, 7:33am

Hi,

Could you test your model with TensorRT app directly?

cp -r /usr/src/tensorrt/ .
cd tensorrt/samples/
make
cd ../bin/
./giexec --deploy=/path/to/prototxt --output=/name/of/output

Thanks.

rtk1357 · June 19, 2018, 5:28pm

When I test the model without using sudo, it gives me the following error:

Input “data”: 3x224x224
Output “prob”: 1000x1x1
cudnnEngine.cpp (55) - CUDA Error in initializeCommonContext: 4
could not build engine
Engine could not be created
Engine could not be created

However, when I ran the same command with sudo, it worked fine.

linuxdev · June 19, 2018, 6:53pm

Is the user you are running the program as a member of group “video”? See:

grep video /etc/groups

(this gives access to the GPU)

rtk1357 · June 19, 2018, 8:34pm

Yes, it is.

linuxdev · June 19, 2018, 9:17pm

“strace” has a lot of output, but you might be able to get a useful subset of messages. If your program is “program arg1”, then it would go something like this while running as your regular user and without sudo:

strace program arg1 2>&1 | egrep -i permission

rtk1357 · June 19, 2018, 9:38pm

Is there a problem if I just run the command with sudo? The main error that I’m trying to fix is the failure to deserialize the model.

linuxdev · June 20, 2018, 1:16am

The strace is just to see where the permissions went wrong. I’m guessing you shouldn’t need to run as root, and if that is the case, then you might see a location where permissions failing provides other clues. You can just run as root.

AastaLLL · June 20, 2018, 2:39am

Hi,

Here is a similar topic:
https://devtalk.nvidia.com/default/topic/1010200/gpu-accelerated-libraries/tensorrt-error-could-not-build-engine/

Could you try if the last comment also works for you?
[i]------------------------------------------------------------
I have the same problem with inference running in docker. I found the problem.
You need to set the right CUDA_ARCH for your GPU

For example
export CUDA_ARCH=“50 52”
------------------------------------------------------------[/i]

Thanks.

rtk1357 · June 20, 2018, 5:55pm

What would be the correct CUDA_ARCH for the tx1?

Honey_Patouceul · June 20, 2018, 6:45pm

5.3 so CUDA_ARCH=“53”

rtk1357 · June 21, 2018, 1:17am

Unfortunately, setting CUDA_ARCH didn’t work either.

AastaLLL · June 22, 2018, 8:30am

Hi,

We don’t meet such issue before.
Could you try to re-flash your device with JetPack to check if issue still occurs?

Thanks.

kayccc · June 28, 2018, 2:37am

Hi rtk1357,

Have you resolved the problem? Any result can be shared?

Thanks

rtk1357 · June 28, 2018, 6:09pm

Hi,

I’ve decided to put this issue aside for now to work on other things.