I’m trying to follow a standalone example of Reinforcement Learning, but I faced a trouble issue.
I just launched the python files as the document says, but the result are different to the document.
the below figures show the train parameters, and I think the RL is not executed properly.
I let it learned 5 times at the two different workstation environments, but the all results act like this video.
How can I fixed this issue, and let my agent learned successfully ?
Thank you !