AI-Snake Competition

1. Presentation

1.1. Problem definition

1.1.1. Origin

Nokia original game:

https://playsnake.org

1.1.2. Rules and Objective

Max: Score/Max_Score
- Eat a maximum number of apples
- Until end/win the game (max apples = rows * cols - snake.init_len)
Min: Step/Score
- With the minimum number of time-steps

1.1.3. The problem as a graph

Shortest distance in a time dependant graph.

Partly Stochastic: next objective location is unkonwn and random.

1.1.4. The problem as a Markov Decision MDP

States space:
Actions space:
Transitions: Deterministic (vs. Probabilistic). P(s, a, Succ(s,a)) = 1
Rewards:

1.2 The Snake Competition

1.2.1. _

todo

1.2.2. Game variations

The game can be modified for the competition:

Parameters	Nokia	Snake Competition
Board size	_ rows x 15 cols	30 rows x 30 cols
Snake Starting length	3 body parts	1 body part
Snake Starting position	Top Center	Random
Snake Starting direction	Top-Down	n/a

1.2.3. Performance indicators

The Game Completion Score (GCS)
- Avg. score over the maximum score (Score = number of Apples eaten)
- Obj: Maximise (up to 100%)
The Game Over rate (GOR)
- Number of game_over / total_games played
- Objective: Minimize
The Performance rate (PR)
- Avg. ( number of Steps / score )
- Obj: Minimise
- Max: n^2
The Performance Rate at 0% and at 1% (PR0 and PR1)
- The Performance rate with 0% Game-over rate (all win)
- The Performance rate with a Game-over rate < 1%

2. Possible AI appoaches

A. Operational Research
- A1. Search
  Pathfinding algorithms, such as: Dijkstra, A*..
  To research: Pathfinding in time-dependent graphs
- A2. Optimization
  Method: Linear Programming
- A3. Genetic algorithm
  NEAT algo with NN
B. Machine Learning
- B1. Supervised Learning Deep Learning
  Methods: Neural Network, Deep (DNN), Convo (CNN)
- B2. Reinforcement Learning (RL)
  Q-value iteration
  Deep Q-Netwrok (DQN)
  Further: Advantage Actor-Critic (A2C), Proximal Policy Optimization (PPO), Monte Carlo Tree Search

3. Current Development

Game features:

General
- Basic game logic
- Human agent (Keyboard)
- Create GIU using pygame
- Refactor code using OOP
- Separate Engine and Agent logic
- Refactor Session/Engine class for OpenAI Gym interface Env()
- Refactor Agent
History
- Record Episodes
Replay and Tests
- Save env states and solutions
- [ ]
Benchmarking
- [ ]

A1. Pathfinding Algorithms:

A3. Evolution/Genetic algorithm selecting NN:

[ ]

B1. Supervised Learning: Deep Learning:

Generating Training data

B2. Reinforcement Learning:

Q-Learning

[ ]

Deep Q-Learning

[ ]

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Ex		Ex
data		data
docs		docs
utils		utils
.gitignore		.gitignore
.json		.json
LICENSE		LICENSE
README.md		README.md
data.json		data.json
snake.py		snake.py
snake_agent_abstract.py		snake_agent_abstract.py
snake_agent_ai.py		snake_agent_ai.py
snake_agent_astar.py		snake_agent_astar.py
snake_agent_bfs.py		snake_agent_bfs.py
snake_agent_factory.py		snake_agent_factory.py
snake_agent_greedy.py		snake_agent_greedy.py
snake_agent_rl_qlearning.py		snake_agent_rl_qlearning.py
snake_core.py		snake_core.py
snake_engine.py		snake_engine.py
snake_game.py		snake_game.py
snake_game_session.py		snake_game_session.py
snake_ui.py		snake_ui.py
snake_utils.py		snake_utils.py
xxx.json		xxx.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Snake Competition

1. Presentation

1.1. Problem definition

1.1.1. Origin

1.1.2. Rules and Objective

1.1.3. The problem as a graph

1.1.4. The problem as a Markov Decision MDP

1.2 The Snake Competition

1.2.1. _

1.2.2. Game variations

1.2.3. Performance indicators

2. Possible AI appoaches

3. Current Development

About

Releases

Packages

Languages

License

sbeignez/AI-Snake

Folders and files

Latest commit

History

Repository files navigation

AI-Snake Competition

1. Presentation

1.1. Problem definition

1.1.1. Origin

1.1.2. Rules and Objective

1.1.3. The problem as a graph

1.1.4. The problem as a Markov Decision MDP

1.2 The Snake Competition

1.2.1. _

1.2.2. Game variations

1.2.3. Performance indicators

2. Possible AI appoaches

3. Current Development

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages