Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
-
Updated
Mar 16, 2024 - Jupyter Notebook
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Our own development branch of the well known WPF document docking library
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Azure Databricks - Advent of 2020 Blogposts
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
This repository contains Spark, MLlib, PySpark and Dataframes projects
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
大数据框架 Spark MLlib 机器学习库基础算法全面讲解,附带齐全的测试文件
Visualizes the Random Forest debug string from the MLLib in Spark using D3.js
dllib is a distributed deep learning library running on Apache Spark
spark (scala and python)
Implementation of Inferring Networks of Substitutable and Complementary Products Model paper
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker
Bayesian hyperparamter tuning for Spark MLLib
Example from Spark MLLib (in python)
Add a description, image, and links to the mllib topic page so that developers can more easily learn about it.
To associate your repository with the mllib topic, visit your repo's landing page and select "manage topics."