Sentiment Analysis of Tokopedia Tweets using Multinomial Naive Bayes & TF-IDF

This repository contains a Jupyter Notebook program for classifying tweet data from Twitter using the Multinomial Naive Bayes algorithm and TF-IDF, utilizing the Scikit-learn library.

📂 Folder Structure

📁 `Kode` (Code)

This folder contains the scripts used for my thesis, with the following workflow:

Data Crawling – Collecting tweet data
Data Preprocessing & Visualization – Cleaning and analyzing data
NaN Detection (Optional) – Handling missing values caused by formatting issues (e.g., missing commas)
Multinomial Naive Bayes Classification – Training and evaluating the model

📁 `Data` (Dataset)

This folder includes three datasets:

Dataset – The raw data, manually labeled
Preprocessed Dataset – Cleaned data, without stemming
Preprocessed + Stemmed Dataset – Cleaned data with stemming applied

🚀 Feel free to explore and contribute!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Projek Skripsi Upload		Projek Skripsi Upload
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis of Tokopedia Tweets using Multinomial Naive Bayes & TF-IDF

📂 Folder Structure

📁 `Kode` (Code)

📁 `Data` (Dataset)

About

Releases

Packages

Languages

dewiidda/Sentimen-Analysis-Tokopedia-Multinomial-Naive-Bayes-TFIDF

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis of Tokopedia Tweets using Multinomial Naive Bayes & TF-IDF

📂 Folder Structure

📁 Kode (Code)

📁 Data (Dataset)

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

📁 `Kode` (Code)

📁 `Data` (Dataset)

Packages