Skip to content
#

data-pipelines

Here are 262 public repositories matching this topic...

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

  • Updated May 12, 2025
  • HTML
fluvio
preswald

Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

  • Updated May 14, 2025
  • Python
elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

  • Updated May 13, 2025
  • HTML
odd-platform

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

  • Updated Feb 19, 2025
  • Java

Improve this page

Add a description, image, and links to the data-pipelines topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-pipelines topic, visit your repo's landing page and select "manage topics."

Learn more