Skip to content
#

ydata-profiling

Here are 15 public repositories matching this topic...

This ETL (Extract, Transform, Load) project employs several Python libraries, including Airflow, Soda, Polars, YData Profiling, DuckDB, Requests, Loguru, and Google Cloud to streamline the extraction, transformation, and loading of CSV datasets from the U.S. government's data repository at https://catalog.data.gov.

  • Updated Dec 10, 2023
  • HTML

The Exploratory Data Analysis (EDA) App is a Streamlit-based web application that allows users to perform comprehensive exploratory data analysis on their datasets. This app provides an intuitive and user-friendly interface for uploading CSV files, visualizing the input data, and generating an interactive profiling report.

  • Updated Mar 12, 2025
  • Python

Data profiling y-data profile, Data staging (Staging tables), Talend for ETL jobs, MySQL validations Dimensional model (Target tables), Facts and Dimensions, Mapping document explaining the source column name and where it finally maps to target column, Stage to Target, Document all transformations if any

  • Updated Dec 6, 2024
  • HTML

Data Sweeper Pro+ is an advanced data cleaning and transformation platform built with Streamlit. It allows users to upload datasets, clean them, analyze them with interactive profiling reports, and export the cleaned data in multiple formats. The app is designed for both technical and non-technical users.

  • Updated Feb 21, 2025
  • Python

Improve this page

Add a description, image, and links to the ydata-profiling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ydata-profiling topic, visit your repo's landing page and select "manage topics."

Learn more