This project demonstrates a comprehensive data analysis pipeline using Python and SQL.
It leverages the Kaggle API to fetch a retail orders dataset, performs data cleaning and transformation with Pandas,
and loads the processed data into SQL Server for in-depth analysis.
Utilizes the Kaggle API to download the required dataset.
Employs Pandas to handle missing values, inconsistencies, and perform data transformations.
Populates SQL Server with the cleaned and prepared data for efficient analysis.
Identifying top-selling and revenue-generating products.
Analyzing regional sales trends.
Comparing sales performance across different time periods.
Evaluating profit growth for various product categories.
Python version above 4
Pandas
SQL Server
Kaggle API credentials
Enhancing the data analysis queries.
Adding new features or visualizations.
Improving the project's documentation.