Trouble with reinforcement learning task in Isaac Lab

pezzzas.work · August 21, 2024, 9:12pm

Hello,

Recently I managed to train neural networks to balance a double pendulum (including the swing up) using a naive and very simple evolutionary algorithm. I now want to compare training speed and results with more modern and robust RL algorithms such as the ones provided with Isaac Lab on the exact same task. I modified the cartpole example to turn it into a double pendulum and modified the reward terms accordingly.

However after a few hours of training no solution was found (using rl_games and skrl with 8192 envs).
I then tried to use a easier setup by using a very low gravity and high rotational damping on the poles but it was also unsuccessful.

I am currently unsure if the problem comes from

the reward function
the setup
the task in itself being too challenging
or if I just need to increase the number of envs or training time

Any insight would be very helpful!

Thanks,
Pezzza

ToddT · August 22, 2024, 7:09am

Hi @pezzzas.work ,

Thank you for your post.
Could you please share your modification/patch about the cartpole so we can try it on our end?

Thanks,
-Todd

pezzzas.work · August 22, 2024, 5:58pm

Thank you for your quick response.

I created a fork of the repo and put my changes in a branch called “double_pendulum_older” that you can checkout here https://github.com/johnBuffer/IsaacLab/tree/double_pendulum_older

I run the training using this command (replacing skrl with rl_games depending on the algorithm to test):

isaaclab.bat -p source\standalone\workflows\skrl\train.py --task Isaac-Cartpole-v0 --num_envs 4096 --headless

If you need any additional information don’t hesitate to ask!

ToddT · August 27, 2024, 2:32pm

Hi @pezzzas.work ,

The good news, we’ll have double pendulum task in the next release of isaac lab.
You can compare the implementation then.

Thanks,
-Todd

pezzzas.work · August 27, 2024, 2:35pm

Hi Todd,

A few questions regarding this information:

Do you know if this includes the swing up?
Do you know when this next release will be available?

Thanks,
Jean

ToddT · September 2, 2024, 2:15am

Hi @pezzzas.work ,

Do you know if this includes the swing up?

No, it doesn’t include swing up.

Do you know when this next release will be available?

It’s about the 2nd of September.

Thanks,
-Todd

pezzzas.work · September 2, 2024, 7:27am

Hi Todd,

Thank you for your response. I will try the new release when it will be out and see if I can manage to add the swing up then.

Thanks,
Jean

system · September 16, 2024, 7:27am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Use different learning algorithms than PPO Isaac Gym	5	3124	February 20, 2022
IsaacSim 2021.2.1 RL example Isaac Sim rl	2	737	April 5, 2024
Open AI Gym + ISAAC SIM Isaac Sim	10	3192	April 5, 2024
IsaacGymEnvs: Cartpole task reward edit Isaac Gym	4	1346	March 22, 2022
Isaac sim reinforcement learning Isaac Sim	15	2778	December 1, 2023
Multi-agent RL in isaacgym Isaac Gym	0	627	February 27, 2022
App_isaacsim/rl_samples.html - RL sample link is broken General Discussion	2	581	January 4, 2022
Is it possible to do Reinforcement learning research with isacc sim? Isaac Gym	1	469	December 1, 2021
Unable to train multi environment robot Isaac Sim isaacsim , gym	8	2730	December 28, 2022
Applying DQN or any DRL algorithm to kuka_bin.py or franka_cube_ik.py Isaac Gym pytorch , python , reinforcement-learning , gym	0	775	February 9, 2022

Trouble with reinforcement learning task in Isaac Lab

Related topics