Movie Recommendation System

A movie recommendation system using collaborative filtering with Surprise and PySpark and a content-based kNN system. The collaborative filtering models recommend movies based on user preferences, leveraging past ratings to find similar users and suggest unseen movies. The kNN model uses keywords and genres to find movies similar to a user’s selected titles, offering quick suggestions based on content. The system includes a Streamlit frontend for movie search, selection, and recommendations.

Both approaches have distinct advantages:

Collaborative Filtering excels at personalization, using user behavior patterns to make recommendations that may surprise the user but fit their tastes.

kNN shines in cold-start situations, where no historical user data is available. It bases recommendations on the content of the movies themselves, ensuring users get relevant results even with no prior behavior data.

Features

Collaborative Filtering:
- SVD (Singular Value Decomposition) for collaborative filtering based on user ratings.
- ALS (Alternating Least Squares) for collaborative filtering, leveraging user ratings for recommendations.
Content-based kNN:
- Utilizes TF-IDF vectorization on movie genres and keywords.
- Implements k-Nearest Neighbors (kNN) to recommend similar movies based on user-selected movies.
Streamlit Frontend:
- Interactive user interface for movie search and selection.
- Displays selected movie tags and provides recommendations based on user inputs.
- Allows dynamic addition and removal of movie tags.

Installation

Clone the repository
Install the required packages:
```
pip install -r requirements.txt
```

Usage

Run the Streamlit App

To start the Streamlit frontend, use the following command:

streamlit run app.py

Streamlit interface

Data

The dataset used for this project is the MovieLens dataset which includes movie ratings and metadata. The following files are utilized:

ratings_small.csv: Contains user ratings for movies.
movies_metadata.csv: Contains metadata about movies.
links.csv: Maps movies to their TMDB identifiers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Movie Recommendation System

Features

Installation

Usage

Run the Streamlit App

Streamlit interface

Data

Files

README.md

Latest commit

History

README.md

File metadata and controls

Movie Recommendation System

Features

Installation

Usage

Run the Streamlit App

Streamlit interface

Data