Severe slow down of play time in grasping task


I am training a grasping task with the ASE approach. A hand should learn to grasp an object.
The play time increases overtime probably because the hand learns to make contact with the object producing more contacts.
But suddenly the playtime spikes to 3h+ per epoch making training impossible.

What could be the reasons for this? Maybe unstable simulation parameters?