Detecting Fake Reviews using Semi-Supervised Learning from the Yelp Restaurant Reviews Dataset
-
Code
- main.py --> Main Python File containing the code for the entire project
- FakeReviewDetection.sh --> Script File to Run main.py.
(This file installs all the libraries required for running the project using pip3 and runs the code using python3. If using a different version, please change the command from pip3 to pip/pip2 and python3 to python depending on your version.)
-
Data
- df.csv --> This is a cleaned, pre-processed and feature engineered version of the original dataset. (Provided for reference)
-
Evals --> Contains Screenshots of various outputs and test runs of both the ML models.
- Clone the github repository onto your desktop.
- Download the original dataset from the the following link (936MB). (Unable to upload this file as Github does not support uploading large files.)
- Copy the dataset .db file in the following directory.
Fake-Review-Detection/Data
- Open terminal inside the Code directory.
- Enter the following command to give permissions to FakeReviewDetection.sh file.
chmod 777 FakeReviewDetection.sh
- Run the script file using the following command.
./FakeReviewDetection.sh