ARM64 does not support NUMA - returning NUMA node zero

mte5132 · July 6, 2022, 4:33pm

I realize there are similar posts, but none seem to quite fit? The most similar post:

indicates a “known issue” and suggests reverting to v2.5.0+nv21.8 from TensorFlow v2.6.0+nv21.9. This post was from October of 2021 so is hard to believe it was not fixed before our release?
TensorFlow Version NVIDIA TensorFlow Container JetPack Version
2.7.0 22.01 4.6.1
Please advise…

AastaLLL · July 7, 2022, 3:54am

Hi,

This should be a harmless warning only.
Do you get any errors or incorrect results from this?

Thanks.

mte5132 · July 7, 2022, 12:49pm

From what I gathered online this means it is not using GPU? The code runs and completes, and the results are correct, but is quite a bit slower than expected. Running CPU only on my desktop is 25% faster. Perhaps there is some other configuration issue if this error message does not indicate it is ignoring the GPU?
So I missed this output later on… this seems to indicate it may be using GPU, but surprising slow:
2022-07-06 10:33:32.139760: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1525] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 23535 MB memory: → device: 0, name: Xavier, pci bus id: 0000:00:00.0, compute capability: 7.2

AastaLLL · July 8, 2022, 2:27am

Hi,

No. NUMA is not related to GPU.

For performance, could you try to maximize the device performance first?

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

Thanks.

mte5132 · July 8, 2022, 1:33pm

yes, I did both of those: the first after install, and the second yesterday after my post… and although it improved the performance it is still pretty “bad” comparatively… I can run a CPU intensive normal C++ app on both and in that case I get better results on the Xavier (CPU only). But running model predictions with TF/Keras in Python on the Xavier using the GPU the results are disappointing. The prediction inputs are fairly large, there is quite a bit of data and many models in a single call, so perhaps the GPU memory is constraining performance? I ran trials reducing both and the performance gets much better, but in no case did it reach the performance of my desktop (Windows no less) without a GPU. When I reduce the load it is a bit more than a factor of 2 slower. For larger loads, the two system even out, perhaps both are reaching limits? My GPU experience is limited, so I am looking into what else I might do to improve the performance. Are there any other parameters I might tweak related to the GPU? I am also considering how I might break up the data sets and merge the results… but that is work :)

AastaLLL · July 14, 2022, 3:45am

Hi,

We recommend users try our TensorRT inference engine for performance.

Not sure what kind of model you want to use.
Below are some benchmark data of TensorRT for your reference:

Thanks.

system · August 3, 2022, 3:23am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ARM64 does not support NUMA - returning NUMA node zero Jetson Nano tensorflow	2	1550	November 24, 2021
NUMA Error running Tensorflow on Jetson Tx2 Jetson TX2	5	22075	October 11, 2023
ARM64 does not support NUMA - returning NUMA node zero Jetson Nano	3	7136	October 18, 2021
Tensorflow error in NVIDIA TX1 Jetson TX1	7	1908	December 30, 2017
Tensorflow not finding any GPUs Jetson Xavier NX tensorflow	7	1708	July 20, 2022
Your kernel may have been built without NUMA support. General	0	4342	August 7, 2018
Tensorflow GPU Jetson TX2	2	584	October 18, 2021
Problem running tensorflow Jetson Xavier NX tensorflow	4	2668	May 3, 2023
Device: 0, name: Orin, pci bus id: 0000:00:00.0, compute capability: 8.7 Jetson AGX Orin tensorflow	3	446	November 23, 2023
Running tensorflow without AVX on two xeon X5670 CUDA on Windows Subsystem for Linux	0	1004	July 5, 2020

ARM64 does not support NUMA - returning NUMA node zero

Related topics