Skip to content

A spam classification model which uses a sms spam classification dataset.

Notifications You must be signed in to change notification settings

harshit-saraswat/NLP-Spam-Classification

Repository files navigation

NLP-Spam-Classification

A spam classification model which uses a sms spam classification dataset from https://archive.ics.uci.edu/ml/datasets/sms+spam+collection

The model displays the use of nlp techniques such as Porter Stemming, WordNet Lemmatization, Tokenization, Stopwords Removal, Bag of Words Model and TF-IDF Model.

The model has pretty good accuracy though due to imbalanced dataset there might be some errors. We can fix this by upsampling or downsampling the dataset in further edits.

The model uses a Multinomial Naive Bayes Classifer for making predictions.

Thanks.

About

A spam classification model which uses a sms spam classification dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published