Othello Player. Reinforcement learning with q-Table

In Train function:

A model playes itself, (always as black using state inversion). Epsilon Greedy with a decay function ensures sufficient exploration outside the previously learnt policy. A Heuristic evaluation function is used after every move to update the q-table.

In Evaluate function:

The model, using a specified q-table, playes against a random player.
Win rate and win/loss ratio are recorded.

Flask

A flask app presents a web page game where the user can play a model, using a predetermined Q-table. The baord is updated with AJAX.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
ZZZ		ZZZ
qtables		qtables
static		static
templates		templates
test		test
+FETCH VARS		+FETCH VARS
+TODO.MD		+TODO.MD
.gitignore		.gitignore
README.md		README.md
_evaluate.py		_evaluate.py
_playterminal.py		_playterminal.py
_runs.py		_runs.py
_train.py		_train.py
app.py		app.py
evaluate.py		evaluate.py
othello.py		othello.py
requirements.txt		requirements.txt
restart-othello.sh		restart-othello.sh
train fn steps.md		train fn steps.md
train.py		train.py
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Othello Player. Reinforcement learning with q-Table

In Train function:

In Evaluate function:

Flask

About

Releases

Packages

Languages

nimchimpski/Othello-Q-learning

Folders and files

Latest commit

History

Repository files navigation

Othello Player. Reinforcement learning with q-Table

In Train function:

In Evaluate function:

Flask

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages