- 실습 교육 내용:
- DQN
- 실습 내용:
- Prioritized Experience Replay (DQN+) [Schaul et al. ICLR 2016]
- Dueling network architectures for deep reinforcement learning (DQN++) [Wang et al. ICML 2016]
- 실습 교육 내용:
- REINFORCE
- Vanilla Policy Gradient (VPG) in the RLLAB library
- 실습 내용:
- Deep Deterministic Policy Gradient (DDPG) [Lillicrap et al. ICLR 2016]
- 실습 교육 내용:
- Generative Adversarial Imitation Learning (GAIL) [Ho et al. NIPS 2016]
- 실습 내용:
- Hyperparameter 조절해가면서 직접 실험.