Using RL to set starting position of a body in the environment

ljkelm · January 27, 2022, 8:51pm

Good afternoon,

I am starting to sink my teeth into using Isaac Gym for reinforcement learning. I have been looking at the cartpole as well as other examples, and I have a question on what we are able to get the program to manipulate in order to increase the reward. What I would like to do is have the RL figure out where to place the starting position of the pole in order to balance it, allowing it to select between 0 and pi in increments of 0.1, rather than adjust the forces applied to the cart to get it to balance.

It seems that there is a num_actions value of 1 within this simulation that seems to correspond to the action of applying the force to the cart, and RL seems to create/control the actions tensor associated with this force throughout the simulation. Is there a way to have the RL create/control a position at simulation reset rather than applying forces/setting position targets during the simulation?

kellyg · January 28, 2022, 3:41pm

Hi there,

Yes, this should be possible. In your pre_physics_step function, you can pass your actions into the reset function and set the position targets there instead of applying them as forces. You can also modify the num_actions value accordingly depending on the dimension of actions you require.

ljkelm · January 31, 2022, 4:53pm

Ok, I did this by creating a global variable within the pre_physics_step function which was created from the actions variable which I believe is what the AI interfaces with. I rounded it to the nearest 0.1 using torch.round, and called this global variable within the reset functions as you had directed. The AI found the correct position with the rewards that were set and by the end of 1000 iterations all of the poles were spawning in the correct position to stay upright.

As a follow up question, it appears the pre_physics_step generates multiple tensors in between each reset, as it is intended to be used to move the cart back and forth to balance the pole. Is there a place where you set the number of times this step is repeated between resets? It just seems like I am generating a lot of data that isn’t going to be used anywhere which is wasteful.

Topic		Replies	Views
IsaacGymEnvs: Cartpole task reward edit Isaac Gym	4	1207	March 22, 2022
Potential issue with environment resets in the pre physics step Isaac Gym	0	333	October 30, 2023
Need help with debugging my custom RL task Isaac Sim gym	2	164	July 4, 2024
Trouble with reinforcement learning task in Isaac Lab Isaac Sim python	7	192	September 2, 2024
Customization of IsaacSim Deep RL example Isaac Sim	4	1415	March 30, 2023
Get the arm to the target value before processing to next step in a Reinforcement Learning Task Isaac Gym	0	81	June 24, 2024
Set initial position of robot / articulation before simulation starts in Python Isaac Sim robot-pose	2	438	March 26, 2024
The position of rigid_body will change repeatedly between two different positions when the environment is reset Isaac Gym	7	1129	December 30, 2021
The initial pose of the robot is not corrent Isaac Sim	3	369	April 24, 2023
Updating OmniKit when using OpenAI Gym and Isaac Sim Isaac Sim rl , gym	2	1311	April 5, 2024

Using RL to set starting position of a body in the environment

Related topics