Deep-Q-Learning

Implementation of Deep Q-Learning and Double Deep Q-Learning algorithms for the Highway-env Gym environment.

Uses a Conv Net to approximate the Q-function. Supports stacking of frames to capture temporal information with an LSTM layer. In both algorithms, the model is trained using a replay buffer and target network to stabilize learning.

Files

dqn.ipynb: Deep Q-Learning notebook
ddqn.ipynb: Double Deep Q-Learning notebook

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
ddqn.ipynb		ddqn.ipynb
dqn.ipynb		dqn.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Q-Learning

Files

About

Releases

Packages

Languages

rodrigo-pedro/Deep-Q-Learning

Folders and files

Latest commit

History

Repository files navigation

Deep-Q-Learning

Files

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages