Skip to content

Krisanth-21/Python-SQL-Data-Analysis-Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python-SQL Data Analysis Pipeline

DESCRIPTION:

This project demonstrates a comprehensive data analysis pipeline using Python and SQL. 
It leverages the Kaggle API to fetch a retail orders dataset, performs data cleaning and transformation with Pandas,
and loads the processed data into SQL Server for in-depth analysis.

KEY FEATURES:

Data Acquisition:

Utilizes the Kaggle API to download the required dataset.

Data Cleaning and Preparation:

Employs Pandas to handle missing values, inconsistencies, and perform data transformations.

Data Loading:

Populates SQL Server with the cleaned and prepared data for efficient analysis.

Data Analysis: Executes SQL queries to address specific business questions, such as:

Identifying top-selling and revenue-generating products.
Analyzing regional sales trends.
Comparing sales performance across different time periods.
Evaluating profit growth for various product categories.

PREREQUISITES:

Python version above 4
Pandas
SQL Server
Kaggle API credentials

Feel free to contribute to this project by:

Enhancing the data analysis queries.
Adding new features or visualizations.
Improving the project's documentation.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published