How parallel RL process works in ISAAC gym?

Hello,
I wanted to get an idea on how parallelism works with PPO?
Does each environment running in parallel update the policy sequentially?

1 Like