Wellness-Watch

Machine learning models to perform sentiment analysis for mental health statuses.

Training and testing data can be found in kaggle-sentiment-data.csv.

Baseline Model

The original model created by Muhammad Faizan, a researcher on Kaggle, can be found in sentiment-analysis-for-mental-health-nlp.ipynb.

This model utilizes logistic regression classification. However, this model was discovered to have data leakage into the test set due to data augmentation occuring before the data was split into train and test sets. A new baseline model was created to move data augmentation after the data is split into train and test sets in order to prevent data leakage into the test set. This model can be found in baseline_sentiment-analysis-for-mental-health-nlp.ipynb.

Students' Models

Logistic Regression (improved)

An improved model can be found in logistic_regression.ipynb.

This model improves upon the baseline model by removing data augmentation, changing hyperparameters, stratifying the status in train-test split, and making slight modifications to preprocessing.

Random Forest

The model can be found in random_forest.ipynb.

Naive Bayes Classifier

The model can be found in naive_bayes_classifier.ipynb.

Gradient Boosting Machine

The model can be found in gradient_boosting_machine.ipynb which runs GridSearchCV to find the best parameters for the model.

The model found in gradient_boosting_machine_best_params.ipynb uses the parameters obtained from gradient_boosting_machine.ipynb to get the best accuracy for this model.

Support Vector Machine

The model can be found in support_vector_machine.ipynb which runs GridSearchCV to find the best parameters for the model.

The model found in support_vector_machines_best_params.ipynb uses the parameters obtained from support_vector_machine.ipynb to get the best accuracy for this model.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
Wellness-Watch-Final-Report		Wellness-Watch-Final-Report
Wellness-Watch_pngs_Midterm-Report		Wellness-Watch_pngs_Midterm-Report
.DS_Store		.DS_Store
README.md		README.md
bert_test.ipynb		bert_test.ipynb
processed_data.csv		processed_data.csv
sentiment-analysis-for-mental-health-nlp_Midterm_Report.html		sentiment-analysis-for-mental-health-nlp_Midterm_Report.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wellness-Watch

Baseline Model

Students' Models

Logistic Regression (improved)

Random Forest

Naive Bayes Classifier

Gradient Boosting Machine

Support Vector Machine

About

Releases

Packages

Contributors 3

Languages

The-Girlies/Wellness-Watch

Folders and files

Latest commit

History

Repository files navigation

Wellness-Watch

Baseline Model

Students' Models

Logistic Regression (improved)

Random Forest

Naive Bayes Classifier

Gradient Boosting Machine

Support Vector Machine

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages