Pinned Loading
-
US-Immigration-Study-ETL
US-Immigration-Study-ETL PublicDemonstrating the setup for an ETL pipeline with Spark in an AWS EMR cluster using a Jupyter Notebook
Jupyter Notebook
-
DevOps-IaC-CloudFormation
DevOps-IaC-CloudFormation PublicInfrastructure as Code to deploy a web application through AWS CloudFormation
Shell
-
Data-Pipelines-with-Airflow
Data-Pipelines-with-Airflow PublicThis is an ETL pipeline between S3 and AWS Redshift using Apache Airflow
Jupyter Notebook
-
Data-Lake-with-Spark
Data-Lake-with-Spark PublicThis is an ETL pipeline for a Data Lake in Spark running on a cluster using AWS. The data is loaded from S3, processed into analytics tables using Spark, and loaded back into S3.
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.