This GitHub repository hosts the code and resources for a project that analyzes Steam user behavior data and builds a recommendation system using various techniques. The project is designed to explore and visualize the dataset, generate recommendations, and evaluate the effectiveness of different recommendation methods.
- Loads and preprocesses a dataset of Steam user behaviors.
- Cleans missing values and removes irrelevant columns.
- Provides summary statistics and insights.
Bar plots of:
- The top 10 most played games.
- Games with the highest average playtime.
Implements three recommendation approaches:
- Collaborative Filtering: Recommends games based on cosine similarity of user behaviors.
- Association Rule Mining: Identifies frequent itemsets using Apriori and computes association rules.
- Matrix Factorization: Applies Singular Value Decomposition (SVD) to recommend games.
Evaluates recommendation methods using:
- Precision
- Recall
- Mean Average Precision (MAP)
- Python 3.12.4 or higher.
- Jupyter Notebook for running the .ipynb file.
The project uses the steam-200k.csv dataset, which contains Steam user behavior data, including user interactions such as purchases and playtime for various games. The dataset is loaded using pandas.
Dataset File: steam-200k.csv
Delimiter: ;
- Clean and explore the data.
- Visualize game statistics.
- Build and evaluate recommendation systems.
Visualizations:
- Top 10 most played games.
- Top 10 games with the highest average playtime.
Recommendations for users and games using three techniques.
Evaluation metrics (Precision, Recall, MAP) for all methods.