RL Performance drop as the number of environment increases

dong-jin.kim · March 22, 2023, 4:47pm

Hello,
I’ve been working with this custom environment I created.
When I test this environment with Stable Baselines3 or even on IsaacGym with a single instance of the environment, the optimal policy is found and it quickly converges to the end goal.

However, when I run multiple environments distributed over multiple gpus, the performance drops significantly in terms of actor/critic convergence, and the rewards it collects.
I am not even talking about 1000+ environments, even with 3-6 parallel environments on 3 GPU nodes, the policy does not converge and reward it accumulates is roughly only a half of what the single environment agent collects.

I’ve created SAC agent with multi gpu support (pytorch distributed), and currently the model params are synchronized before the optimizer.step() is called and after the loss backward is calculated. The params are synced via reduced sum ops.

Or does the reward function defined for single environment need to be modified to suit for multi environments? I’ve tested with reward function too, and it does change the learning curves but again no convergence with multiple parallel enviromnents.

Any similar experiences and insights will be really appreciated! Thanks

vmakoviychuk · April 9, 2023, 10:57pm

Hello @dong-jin.kim ,

It’s hard to say what could be wrong in your case or how to debug it as it could be related to SB3 implementation details. I have one general comment - there is no need in multi-gpu training if you are running less than 1K env per GPU. If you are running only 3-6 envs per GPU across 3 GPUs it might make sense to debug first on a single GPU with 9-18 envs or more. Also you could find useful to look into SAC training examples in isaacgymenvs.

Topic		Replies	Views
Based on Custom RL Example using Stable Baselines , multiple envs wrong! Isaac Gym	7	2014	October 5, 2023
Unable to train multi environment robot Isaac Sim isaacsim , gym	8	2929	December 28, 2022
Issue with Multi-GPU Support in IsaacGymEnvs+SKRL Isaac Gym	0	453	October 24, 2023
Possible to running multiple gym environments in parallel? Isaac Sim	2	692	April 5, 2024
Poor performance of Soft Actor Critic (SAC) in OmniverseIsaacGym Isaac Sim	1	822	June 1, 2024
How to train multiple environments with RL in one scene at the same time? Isaac Sim	3	480	April 5, 2024
Run Isaac gym on multiple machines' GPUs in parallel Isaac Gym	3	1055	June 7, 2022
How parallel RL process works in ISAAC gym? Isaac Gym	0	523	December 19, 2022
Multiple isaac-sim containers on one GPU fails with CUDA illegal memory access in [omni.physx.tensors.plugin] Isaac Sim	5	2375	November 17, 2023
Can Isaac Gym support multi-agent reinforcement learning? Isaac Gym	12	3231	December 19, 2023

RL Performance drop as the number of environment increases

Related topics