Releases: yycho0108/CartPole_tf
Releases · yycho0108/CartPole_tf
Stable working version
First stable-ish working version of DDRQN with engineered reward,
but with state penalty (no velocity information given).
Auto-generated parameters:
- TAU : 0.001
- N_TRACE : 8
- N_X : 4
- DESCRIPTION :
- HS : [32, 64]
- LR_DECAY_STEPS : 10000000
- N_BATCH : 32
- EPS_MIN : 0.05
- EPS_ANNEAL : 2000000
- N_TEST : 200
- N_PRE : 100000
- N_MEM : 10000
- LR_MAX : 0.0001
- N_H : 64
- LR_MIN : 1e-05
- U_FREQ : 64
- EPS_DECAY : 0.999998502135
- EPS_INIT : 1.0
- N_A : 2
- GAMMA : 0.99
- N_EPOCH : inf