Black Friday Dataset: EDA and Feature Engineering

This project focuses on conducting Exploratory Data Analysis (EDA) and Feature Engineering on the Black Friday dataset. As a beginner, I aimed to apply foundational data science techniques, from cleaning and understanding the dataset to preparing it for model training.

Project Overview

The Black Friday dataset contains customer purchase details. The main objective of this project is to:

Explore the dataset to identify patterns and insights.
Engineer features that could help in building predictive models.
Prepare the data for training machine learning models.

Key Steps in the Project

Data Loading and Preprocessing:
- The dataset was loaded using pandas.
- Missing values were handled by separating rows with missing purchase values into test and train sets.
Exploratory Data Analysis (EDA):
- Descriptive statistics were performed to understand the dataset.
- Visualizations such as bar plots were used to explore purchase trends and customer demographics.
Feature Engineering:
- Unnecessary columns, like Product_ID, were dropped from the feature set.
- Numerical features were scaled using StandardScaler to prepare for machine learning models.
Train-Test Split:
- The data was split into training and test sets using train_test_split from sklearn.
- The target variable was the Purchase column, and all other columns were used as features.
Feature Scaling:
- Implemented scaling of the features to standardize the data, which is essential for certain machine learning models.

Next Steps

The dataset is ready to be used for training machine learning models.

Libraries Used

pandas
numpy
matplotlib
seaborn
scikit-learn

How to Run the Project

Clone the repository:

git clone https://github.com/YashsTiwari/BlackFriday-EDA-and-Feature-Engineering.git

Install the required libraries:
```
pip install -r requirements.txt
```
Run the Jupyter notebook to view the analysis.

Name	Name	Last commit message	Last commit date
Latest commit YashsTiwari Updated README.md Sep 5, 2024 23ef3df · Sep 5, 2024 History 3 Commits
BlackFriday EDA and Feature Engineering.ipynb	BlackFriday EDA and Feature Engineering.ipynb	Jupyter notebook and dataset	Sep 5, 2024
README.md	README.md	Updated README.md	Sep 5, 2024
blackFriday_test.csv	blackFriday_test.csv	Jupyter notebook and dataset	Sep 5, 2024
blackFriday_train.csv	blackFriday_train.csv	Jupyter notebook and dataset	Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Black Friday Dataset: EDA and Feature Engineering

Project Overview

Key Steps in the Project

Next Steps

Libraries Used

How to Run the Project

About

Releases

Packages

Languages

YashsTiwari/BlackFriday-EDA-and-Feature-Engineering

Folders and files

Latest commit

History

Repository files navigation

Black Friday Dataset: EDA and Feature Engineering

Project Overview

Key Steps in the Project

Next Steps

Libraries Used

How to Run the Project

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages