In this project we selected a heart stroke dataset and an email spam one for which we developed machine learning prediction systems by using a number of well-known data mining techiques like data preprocessing,data cleaning,classification using predictors like linear regression,KNN,Random Forests and advanced one's like neural networks.In the case of the heart stroke dataset , imbalance was detected in the class-goal of the prediction, namely the class with the name "stroke" where the number of patients that didn't have a stroke far outweighed those that did.For that reason we used an oversampling technique named SMOTE to balance the dataset for that class so we could conduct a trustworthy and valuable prediction of stroke possibility which otherwise would have been impossible.
-
Notifications
You must be signed in to change notification settings - Fork 0
Data Mining Techniques for Stroke and Email Spam Prediction
License
OperaDevelop07/Data-Mining-Techiques-for-Heart-Stroke-and-Email-Spam-Prediction
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Data Mining Techniques for Stroke and Email Spam Prediction
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published