Frontend

RL agent for gridworld problem

JavaScript implementation for a TD RL agent learning optimal paths on a gridworld. Inspired by Reinforcement learning specialization

To regenerate a new random gridworld - click "apply".

Amount of bombs is scaled bases on size of grid world.

To learn an agent - click on "run RL". Might take some time on slower devices or bigger grid sizes.

implemented a gridworld problem with some obstacles(bombs are bad for the agent)
implemented SARSA agent with these parameters:
- ε-greedy policy (starting with 0.5 and decaying over time)
- 1000 episodes
- no discounted reward
- step size of 0.1

Open Web Components library is used for frontend. No specific reason for it, just wanted to give it a try :)

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
SARSA_THEORY		SARSA_THEORY
assets		assets
dist		dist
src		src
test		test
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
rollup.config.js		rollup.config.js
tsconfig.json		tsconfig.json
web-dev-server.config.mjs		web-dev-server.config.mjs
web-test-runner.config.mjs		web-test-runner.config.mjs