Hello everyone,
I am working on a project where I have two Jetson AGX Orin devices, and I want to combine them in such a way that they can share the GPU workload for computational tasks. Specifically, I would like to use both devices together to pool their GPU resources, allowing for more efficient processing of demanding workloads, such as AI inference or heavy computational tasks.
Is it possible to use both Jetson AGX Orin devices to share GPU resources directly for a single application or task? I understand that the devices have separate GPUs, but is there any way to have them work together to balance the GPU load?
I no longer have an orin nano to confirm this works, but 6 months back I believe the following worked for me as proof of concept test. I used python.
Create hosts file in your project directory, entering your true device IP addresses
192.168.1.10 slots=1 # IP of Jetson AGX Orin
192.168.1.11 slots=1 # IP of Jetson Orin Nano
on both Orins install:
sudo apt-get install -y libopenmpi-dev openmpi-bin
pip3 install cupy-cuda12x mpi4py
Set up passwordless SSH between the devices.
Try the test to have mpirun can run using both gpus.