Spring 2021 Machine Learning (CS 181) Homework 6
Solutions contained in the
personal-solutions
folder
- "Explaining Away" effect in Bayesian Network Inference
- Policy Iteration and Value Iteration in solving a Markov Decision Process (MDP)
- Reinforcement Learning and implementing a Q-learning agent
Implementation contained in the
code
folder
- Referred to as
T6_P2.py
in the specifications - Implements Policy Iteration and Value Iteration to find the optimal solution to a Markov Decision Process (MDP)
- Referred to as
stub.py
in the specifications - Implement a Q-learning agent to play a game that is similar to Flappy Bird