- KrakĂłw, Poland
-
08:57
- 1h ahead - https://stephen137.github.io/
- in/sjbarrie
-
Microsoft_Azure_ETL_Pipeline Public
Automated Medallion architecture ETL data pipeline, leveraging ADLSGen2, Azure Data Factory, Azure Synapse, Databricks and Spark to store, connect and move a variety of data to and from different s…
-
databricks_project Public
Data access via API requests, creation of automated data pipeline using Spark and SQL with orchestration via Azure Data Factory
Jupyter Notebook UpdatedFeb 24, 2025 -
web-scrape-project Public
Scrape coinmarketcap.com and create a real-time crypto portfolio valuation based on user selected holdings
Jupyter Notebook Apache License 2.0 UpdatedJan 18, 2025 -
-
Lost-and-Found Public
Web scraping using Beautiful Soup, data clean up using pandas, visualisation with Tableau
Jupyter Notebook UpdatedJan 7, 2025 -
-
tidytuesday Public
Forked from rfordatascience/tidytuesdayOfficial repo for the #tidytuesday project
HTML Creative Commons Zero v1.0 Universal UpdatedNov 26, 2024 -
R-graph-gallery Public
Forked from holtzy/R-graph-galleryA website that displays hundreds of R charts with their code
HTML MIT License UpdatedNov 21, 2024 -
end_to_end_data_pipeline Public
Fully automated csv to dashboard pipeline using Terraform, Google Cloud Storage, BigQuery, dbt, Prefect and Looker Studio. Peer ranked 9 of 298. Explore the dashboard below
-
data-science Public
Forked from ossu/data-scienceđź“Š Path to a free self-taught education in Data Science!
Other UpdatedSep 30, 2024 -
-
stock-markets-analytics-zoomcamp Public
Forked from DataTalksClub/stock-markets-analytics-zoomcampCourse Materials for Analytics in Stock Markets Zoomcamp
Jupyter Notebook UpdatedMay 27, 2024 -
ETL : Python script, csv, clean with pandas, save to SQLITE database. NLP ML pipeline: tokenizes raw text and classifies. Visualization in Flask app
Jupyter Notebook Apache License 2.0 UpdatedMar 26, 2024 -
Agri_Dashboard Public
Dashboard creating using data from World Bank API,. Front-end Plotly, Bootstrap. Back-end Flask. Deployment Heroku
Python UpdatedMar 8, 2024 -
Detailed analysis of the Stack Overflow Developer Survey 2023. Link to the blog post below :
Jupyter Notebook UpdatedFeb 24, 2024 -
stackoverflow Public
Forked from jjrunner/stackoverflowFindings from Stackoverflow 2017
Jupyter Notebook UpdatedFeb 20, 2024 -
Bayesian-inference Public
A/B testing, decision analysis, and linear regression modeling using a Bayesian approach
Jupyter Notebook UpdatedFeb 10, 2024 -
pymc Public
Forked from pymc-devs/pymcBayesian Modeling and Probabilistic Programming in Python
Python Other UpdatedFeb 9, 2024 -
-
statistical_inference Public
Moving from descriptive statistics to inferential statistics, leveraging parametric and non-parametric using SciPy to measure strength, and bootstrapping to address imbalanced datasets
Jupyter Notebook UpdatedFeb 6, 2024 -
pymc-marketing Public
Forked from pymc-labs/pymc-marketingBayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
Python Apache License 2.0 UpdatedJan 31, 2024 -
fraud_detection Public
Rebalance dataset using sampling techniques (SMOTE), leveraging ensemble, K-means, and text mining LDA (Latent Dirichlet Allocation) models to detect fraud
Jupyter Notebook UpdatedJan 31, 2024 -
customer_lifetime_value Public
Transformation of complex Google Merchandise Store BigQuery raw dataset, into business insights and actionable marketing outcomes using SQL and pandas. Baseline XGBoost regression and classificatio…
-
A curated collection of on-demand courses, labs, and skill badges that provide real-world, hands-on experience using Google Cloud technologies essential to the ML Engineer role.
-
Advent_of_Code_2023 Public
Advent of Code is an Advent calendar of programming puzzles created by Eric Wastl
Jupyter Notebook UpdatedDec 19, 2023 -
Hypotheses_Testing Public
Statistical analysis of a sample of patients who were evaluated for heart disease at the Cleveland Clinic Foundation. The data was downloaded from the UCI Machine Learning Repository below, and the…
UpdatedDec 7, 2023 -
urban_accessibility Public
Geolocation project to estimate amenity accessibility. Combines amenity data harvested from OSM, population data sourced from WorldPop Hub and leverages Uber's H3 grid system to provide location in…
-
Imagery-in-Action Public
Learn how to use ArcGIS Pro, ArcGIS Online, and other apps to visualize, analyze, and derive information from imagery and remote sensing.
Jupyter Notebook UpdatedOct 25, 2023 -
Edinburgh_Airbnb Public
Perform a spatial join and create an interactive Felt choropleth map of Airbnb density per km2 at data zone granular level.
-
ESRI-Spatial-Data-Science Public
Exploring the application of spatial data science to uncover hidden patterns and improve predictive modeling, using powerful analytical tools in Esri's ArcGIS software and learning how to integrate…