Here is a quick summary of my work exploring Board Game Geek's data set.
For a more detailed report check out final_report.pdf
!
I found clear trends in both quantity and quality
I modeled the number of games produced each year as a Poisson with an underlying rate that changes over time as an exponential. Here it is the likelihood function for the data that I used in a stochastic search to fit the model.
I modeled the shifting proportion of high scoring games over time as a binomial with an underlying probability that is changing over time. I modeled the changing probabilities / proportions using an exponential. Below is the likelihood function that was used to fit the model in a stochastic search.
- Themes do tend to have significantly distinct ratings distributions. Warefare had the highest average score, and knowledge based games performed the worst
- Mechanics also had distinct distributions for ratings, and games with mechanics to discourage player interaction performed best
- Games that were funded by a Kickstarter campaign outperformed those which were not