Question about Orbit RL environment

berternats · April 28, 2023, 10:48am

The following picture is code from lift_env.py (Orbit/lift_env.py at 8e42f0574792f971a44459a061dd3704cac38bb1 · NVIDIA-Omniverse/Orbit · GitHub).

I have three questions regarding the highlighted part:

How do we know the robot’s joints reach the target position? Should not we do an if else statement to check if it reaches the target position before proceeding to the next step like computing reward?
What is the meaning of the loop here?
What is the meaning of decimation? In the documentation, it says it is the Number of control action updates @ sim dt per policy dt, but I do not really understand it.

Thank you.

mmittal · May 2, 2023, 1:45pm

Hi @berternats

The IK employed here is a differential inverse kinematics solver which provides delta joint positions for the arm to move to. Given the nature of this method, these delta joint positions are relatively small and it should be reasonable to track them under small number of simulation steps. The same solver has been used priori in the Factory work and it seems to be sufficient.

(2) and (3) kind of go together. The idea is that different controllers run at different frequencies. In this case there are three different controllers:

Learning policy (outermost) — X Hz
IK solver (middle) — Y Hz
Joint level controller (low-level) — Z Hz
Physics simulation — Z Hz (usually)

Control decimation is the formal way of saying how many steps of low-level per step of high-level. Typically the physics simulation and joint control are set to the same frequency so we don’t consider that here.

For instance, let’s say the simulation dt is 1 / 100 s (Z=100 Hz). The low-level joint control typically happens at this frequency. However, the IK control happens at a lower frequency (to ensure tracking), i.e. (Y=Z / (decimation)). If decimation is 2 then IK is happening at 50 Hz. In this particular environment, we have learned policy working at IK frequency (i.e. X = Y) so you don’t see another for-loop at an outer level.

I hope this clarifies the doubt.

berternats · May 3, 2023, 9:14am

Thank you so much for the detailed explanation.

So, normally, if the action delta joint positions are big such as randomly picking joint positions, we should make sure they reach the target positions before proceeding to compute reward, am I right?

system · October 12, 2023, 4:56pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question about Orbit RL environment Isaac Sim	2	316	July 11, 2023
Position is not correct in Isaac Sim Isaac Sim isaacsim , articulation	4	939	April 5, 2024
End_effector position can not converged to desired position when i control robot by InverseKinematicsSolver with RL reach task Isaac Sim	4	334	February 2, 2024
Simulation speed Orbit RL Environment Isaac Sim	0	232	March 1, 2024
Simulation disrespecting joint limits Isaac Sim physx , joints , articulation	17	1216	April 10, 2024
Updating OmniKit when using OpenAI Gym and Isaac Sim Isaac Sim rl , gym	2	1311	April 5, 2024
Orbit DifferentialIKController for velocity control Isaac Sim python	0	592	January 29, 2024
Assistance Needed with Sequential Movement in Robot Simulation Isaac Sim python	4	367	March 4, 2024
OmniIsaacGymEnvs : skrl for iiwa reaching task Isaac Sim	3	386	July 10, 2023
The issue of abnormal joint movement in the legged robot in Isaac Lab Isaac Sim python , isaac-sim-v4-2-0	2	30	December 20, 2024

Question about Orbit RL environment

Related topics