Training on BipedalWalkerHardcore seems to result in a negative reward #7

kirk86 · 2018-10-09T16:57:33Z

Hi and thanks for sharing the code.
I've tried to run the training process on a different environment such as the BipedalWalkerHardcore-v2 but it seems that is not able to learn anything. I even tried with different shift values as noted in the code comments but still in the end I get a negative reward. Should we train for longer or there any hyperparams that we are missing?

The text was updated successfully, but these errors were encountered:

ar8372 · 2022-06-25T07:11:49Z

Hey @kirk86 , I am having similar issue did you solve it?
Do look at this thread for my exact issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training on BipedalWalkerHardcore seems to result in a negative reward #7

Training on BipedalWalkerHardcore seems to result in a negative reward #7

kirk86 commented Oct 9, 2018

ar8372 commented Jun 25, 2022

Training on BipedalWalkerHardcore seems to result in a negative reward #7

Training on BipedalWalkerHardcore seems to result in a negative reward #7

Comments

kirk86 commented Oct 9, 2018

ar8372 commented Jun 25, 2022