Skip to content

Katerinafomkina/Machine-Learning-Summer-Training-Hackathon.-Classification-Problem

Repository files navigation

Machine-Learning-Summer-Training-Hackathon

This is a repository to Machine Learning Summer Training Hackathon 2022 from Analytics Vidhya

Summer Hackaton

The data from Machine Learning Summer Training Hackathon. Steps of my solution:

  1. Univariate analisys of features. Check for normal distribution and outliers
  2. Bivariate analisys of features. Check for correlation between features and target variables
  3. Check for Correlation matrix
  4. Feature Imputing for 'education'
  5. One hot encoding for 'proof_submitted'
  6. Label Encoding for 'education'
  7. Modelling. Models: Logistic Regression, Desicion Tree Classifier, Random Forest Classifier, Bagging Classifier, XGBOOST. My final model is Random Forest Classifier Final macro f1_score=0.524564 for test pd.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published