Skip to content
#

azure-data-lake-storage-gen2

Here are 4 public repositories matching this topic...

Language: All
Filter by language

In this project, I've created an end-to-end ETL pipeline and subsequently developed a machine learning model to predict the price of Amazon products based on several product-related features.

  • Updated Nov 26, 2024
  • Python

End-to-end backend and data hub architecture on Azure, integrating Databricks and a suite of Azure services for seamless data processing, analytics, and deployment.

  • Updated Feb 1, 2025
  • Jupyter Notebook

This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark SQL, data governance with Unity Catalog, and visualization through Power BI. Designed to showcase real-world data engineering workflows in Azure.

  • Updated Nov 14, 2024
  • Python

Improve this page

Add a description, image, and links to the azure-data-lake-storage-gen2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the azure-data-lake-storage-gen2 topic, visit your repo's landing page and select "manage topics."

Learn more