JP4.4 production release and PyTorch 1.6rc2 issue

I inadvertendly updated my machine from JP4.4-dp “developer preview” to JP4.4-pr “production release”.

This requires me to use PyTorch 1.6rc2 (release candidate) which breaks my application (Mozilla-TTS) with error message:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!

Same application with same configuration and data on JP4.4-dp and PyTorch 1.5 did not show this error.

Is there any way to a) roll my machine back to JP4.4-dp or b) to compile/install PyTorch 1.5 for JP4.4-pr?

When will Pytorch 1.6 final release be published and will it address my issue?

Hi @dkreutz, I believe PyTorch 1.6-final is expected sometime next week or the week after, however of course it depends when the PyTorch maintainers release it.

I’m not sure, you may have to check the PyTorch Issues on GitHub or file an issue with them. It’s also unclear if this is a bug or is actually the result of a bug fix.

If Mozilla-TTS is an upstream project, you may want to file an issue with them to test against PyTorch 1.6. It seems that message means that you have a tensor.cuda(), but did not call net.cuda() (or some variation of that, perhaps with multiple tensors). You may want to go through and make sure all the tensors/models are on the GPU.

Unfortunately I don’t believe so, you would probably need to re-flash with the DP (L4T R32.4.2) release.

I wasn’t able to build PyTorch prior to 1.6 for JP 4.4-pr (L4T R32.4.3), because there were cuDNN errors that needed patched. Otherwise I would have provided the 1.5 wheels for JP 4.4-pr as well.

Thanks @dusty_nv for answering. In the meantime I have installed and succesfully ran the same application code, configuration and dataset (Mozilla TTS) on my Xavier-NX which is still on JP4.4dp and PyTorch 1.5. So I conclude there might be an application issue with PyTorch 1.6.

Is there PyTorch 1.6 for JP4.4dp available - I can’t find the PyTorch announcement message in this forum any more…?

Here’s the link to the PyTorch topic: https://forums.developer.nvidia.com/t/pytorch-for-jetson-nano-version-1-5-0-now-available/72048

You could build it from source for JP 4.4 DP, I don’t personally plan on building more PyTorch wheels for the DP release. You may want to post your issue to the PyTorch GitHub about changes in 1.6 that may have led to this change in behavior.

Thanks, will try both…

Built Pytorch 1.6rc2 from source on JP4.4 DP and see the same error.

Pytorch 1.6rc3 is available since a few days - building that right now on both JP4.4 DP and GA and will report on that later (build takes 8-10h)…