Skip to content
This repository has been archived by the owner on Oct 21, 2020. It is now read-only.

Releases: data-mill-cloud/data-mill

Third release

25 Apr 15:13
Compare
Choose a tag to compare
  • Master status at 2019-04-25
  • A few new components: Druidi, Superset, Nifi
  • Bug fixing

Second release

23 Feb 16:14
Compare
Choose a tag to compare
  1. Added installation script to ease initial setup
  2. Made run callable from anywhere (/usr/local/bin)
  3. Configuration yaml for components and k8s can be centralised (all-in-one flavour file) or distributed (per component)
  4. Shortened preamble (loading of component variables and configs) with new utils functions

Second release

12 Feb 13:43
Compare
Choose a tag to compare
Second release Pre-release
Pre-release

Added components for:

  • Cassandra
  • Flink
  • Elasticsearch
  • Kibana
  • Grafana
  • Argo
  • Metallb
  • Ambassador
  • Traefik

First release

12 Feb 13:38
73291cd
Compare
Choose a tag to compare
  1. Kubernetes setup
    • local using minikube, as well as microk8s and multipass+microk8s
    • remote, using sample scripts in KOPS for the setup on AWS and GKE
  2. Setup of common components
    • Ingestion (e.g. kafka, RabbitMQ)
    • Persistent storage (e.g. s3, ArangoDB, InfluxDB)
    • Data Versioning (e.g. Pachyderm)
    • Processing (e.g. dask, spark)
    • Exploration Environment (e.g. JupyterHub)
    • BI Dashboarding (e.g. superset)
    • ML model versioning and benchmarking, as well as project management (e.g. mlflow)
    • ML model serving (e.g. Seldon-core)
    • Monitoring (e.g. prometheus, Grafana)
  3. Data Science environments
    • Scientific Python Environment
    • PySpark Environment
    • Keras/Tensorflow Environment
    • Keras/Tensorflow GPU Environment
  4. Example code
    • notebooks