JetsonAGX Orin: System-level Cache

rnaza005 · November 21, 2024, 1:32am

Hello,

In the Jetson AGX Orin Technical brief document, I see that the 4MB system-level cache is accessible from both the CPU and the GPU according to Figure 2:

However, when we see the figure 4. the GPU has no access to the system-level cache:

Figure 8 shows that the system-level cache is part of the CPU complex:

Then we come to Figure 9 and see that the GPU has direct access to the system-level cache and memory controller interface:

I saw one old question (the linkorin-system-cache) where it turns out that the system-level cache is actually L4 for the CPU. But what about the GPU-side? Is it L3 for the GPU?

Another question: If we allocate memory using cudaMalloc(), whenever accessing this memory from GPU (in CUDA application), do we go through GPU L2 → LPDDR5 or GPU L2 → system-level cache → LPDDR5?
Is there any way to do so if not going through the system-level cache?

I would be happy if someone with knowledge clarified these.

Best. Thanks in advance.

carolyuu · November 21, 2024, 2:00am

Hi,
Here are some suggestions for the common issues:

1. Performance

Please run the below command before benchmarking deep learning use case:

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

2. Installation

Installation guide of deep learning frameworks on Jetson:

TensorFlow: Installing TensorFlow for Jetson Platform - NVIDIA Docs
PyTorch: Installing PyTorch for Jetson Platform - NVIDIA Docs
We also have containers that have frameworks preinstalled:
Data Science, Machine Learning, AI, HPC Containers | NVIDIA NGC

3. Tutorial

Startup deep learning tutorial:

Jetson-inference: Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson
TensorRT sample: Jetson/L4T/TRT Customized Example - eLinux.org

4. Report issue

If these suggestions don’t help and you want to report an issue to us, please attach the model, command/step, and the customized app (if any) with us to reproduce locally.

Thanks!

rnaza005 · November 21, 2024, 2:02am

Hi,

I don’t think this answer is related.

Best.
Regards.

rnaza005 · November 22, 2024, 7:07am

Any help?

AastaLLL · December 4, 2024, 5:44am

Hi,

Sorry for the late update.

You can find the memory management info in the link below:

The memory allocated by cudaMalloc is cached on the GPU side.
However, we cannot disclose more details about cache handling here.

Thanks.

system · December 31, 2024, 7:13am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
GPU Compute and memory benchmarks for Jetson AGX Orin Jetson AGX Orin performance	7	94	December 12, 2024
Orin System Cache Jetson AGX Orin documentation	4	1752	August 30, 2022
Question: AGX orin SOM CPU and GPU Memory assignment Jetson AGX Orin	2	529	July 17, 2023
Confused about memory bandwidth Jetson Orin NX cuda , kernel	5	2231	May 3, 2023
Programmatically limiting GPU access Jetson Orin NX gpu-computing	4	43	July 29, 2024
Memory bandwidth on Orin Jetson AGX Orin	9	972	March 15, 2024
Integrated GPU cache coherence on Orin Jetson Orin Nano cuda	4	927	August 23, 2023
Jetson AGX Orin CUDA IPC Support Jetson AGX Orin cuda , docker , pytorch	7	2042	July 29, 2022
Graphics memory related issues Jetson AGX Orin ai-training	3	752	January 18, 2024
Memory Architecture Differences in x86 and SoC GPUs Jetson Orin Nano cuda , kernel	2	1113	July 25, 2023

JetsonAGX Orin: System-level Cache

1. Performance

2. Installation

3. Tutorial

4. Report issue

Related topics