Actor network output increases to 1, TORCS, TF 1.0.0 #11

Amir-Ramezani · 2017-04-03T07:53:46Z

Hi,

Thanks for your code.

I tried to use it for training TORCS, however, my result are not good and to be specific after a few steps, actions generated by Actor network increases to 1. and stay there. Similar to the following (for the top 10 for example):

[[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]
[ 1. 1. 1.]]

Gradients for that set:
[[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]
[ 4.80426752e-05 1.51122265e-04 -1.96302353e-05]]

I suspect the problem is some where around the following line:

Combine the gradients here

self.actor_gradients = tf.gradients(self.scaled_out, self.network_params, -self.action_gradient)

Could you tell me what do you think is the problem?

I am using tf 1.0.0 CPU version.

Thanks

RICEVAGUE · 2019-03-28T07:40:34Z

Hi!
I am very interested in this issue.
so, could you tell me the details of your solution?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actor network output increases to 1, TORCS, TF 1.0.0 #11

Actor network output increases to 1, TORCS, TF 1.0.0 #11

Amir-Ramezani commented Apr 3, 2017

RICEVAGUE commented Mar 28, 2019

Actor network output increases to 1, TORCS, TF 1.0.0 #11

Actor network output increases to 1, TORCS, TF 1.0.0 #11

Comments

Amir-Ramezani commented Apr 3, 2017

Combine the gradients here

RICEVAGUE commented Mar 28, 2019