Apache Sedona (incubating) is a cluster computing system for processing large-scale spatial data. Sedona extends Apache Spark / SparkSQL with a set of out-of-the-box Spatial Resilient Distributed Datasets (SRDDs)/ SpatialSQL that efficiently load, process, and analyze large-scale spatial data across machines.
Name | API | Spark compatibility | Introduction |
---|---|---|---|
Core | RDD | Spark 2.X/1.X | SpatialRDDs and Query Operators. |
SQL | SQL/DataFrame | SparkSQL 2.1+ | SQL interfaces for Sedona core. |
Viz | RDD, SQL/DataFrame | RDD - Spark 2.X/1.X, SQL - Spark 2.1+ | Visualization for Spatial RDD and DataFrame. |
Zeppelin | Apache Zeppelin | Spark 2.1+, Zeppelin 0.8.1+ | Plugin for Apache Zeppelin |
Twitter: Sedona@Twitter
Sedona JIRA: Bugs, Pull Requests, and other similar issues
- issues@sedona.apache.org: general questions or tutorials
- dev@sedona.apache.org: project development