aws-tileserver - Developer documentation

This page gives a detailled overview of all techiques used for this project.

AWS-Architecture

Overall architecture

Creating the `postgis-client` Docker-image

User pushes new commit to Github
Github creates webhook for AWS CodeBuild
New Docker-Image is created and pushed to ECR

Serving static Content

Client sends http-request to URL assigned to CloudFront Distribution.
CloudFront reads/caches static content from public S3 Bucket.
Http-response is sent to Client.

Serving vectortiles

Client sends http-request to URL assigned to CloudFront Distribution.
CloudFront reads/caches vectortiles from S3 Bucket.
Temporary-redirect (307) to tileserver if vectortile is not available on S3.
Tileserver generates vectortile with data from RDS-postgres-instance, sends it to client and also stores it on S3.

Lambda Implementation Details

Database

The database-instance MUST support at least postgis 2.4.0. Otherwise vectortiles can't be created.

Vector Tiles

aws-tileserver supports configurable REST-endpoints for vector tiles according to Vector Tile Specification 2.1. Each endpoint provides access to a vectortile with configurable layers.

SQL-Query

Each layer is resolved to the following query:

(SELECT ST_AsMVT(q, '${layer.name}', ${layerExtend}, 'geom') as data FROM
    (SELECT ${prefix}ST_AsMvtGeom(
        ${geom},
        ${bbox},
        ${layerExtend},
        ${buffer},
        ${clip_geom}
        ) AS geom${keys}
    FROM ${layer.table} WHERE (${geom} && ${bbox})${where}${postfix}) as q)

All resulting layers are merged into one SQL query:

SELECT ( [${layer1} [|| ${layer2} [|| ...]]] ) as data

Performance, Benchmarks & Timing

Setup

990 (+5 for warm-up) HTTP/2-Requests (IPv4) were made to https://tileserver.cyclemap.link/local/14/8691/5677.mvt
Everything is deployed to eu-central-1
Client timing was collected with curl (see tools/benchmark.sh)
Lambda durations were collected from CloudWatch
The raw results can be found in docs/benchmark.ods

Update 2023

While upgrading to Nodejs 18, I repeated the benchmarks on 2023-09-06 07:00 UTC. Interestingly, the timing has changed in general:

Node Version	Lambda Timing
Nodejs 12 2019	303 ms
Nodejs 12 2023	867 ms
Nodejs 18 2023	670 ms

Further investigation is needed to determine the root cause. Maybe the database is now bigger (=slower).

Next Steps

~~Move database to Serverless Aurora PostgreSQL to reduce monthly costs.~~ Won't do. Resume after pause is 30s+ and keeping 2 ACUs hot at all times is too expensive.
~~Evaluate Data API for Aurora Serverless.~~ Won't do. See above. According to this review, performance also seems bad compared to API-calls.
~~move terraform-state to s3-bucket.~~ Done!
Security-Review for Lambda-Code (e.g. SQL-Injection, ...)
~~Change all scripts to use Postgres environment variables (PGUSER, ...)~~ Only relevant for database processing. Is out of scope here.
Omit Postgres credentials altogether and use IAM-role instead
~~move lambda-function out of VPC to reduce cold-start-time~~ Not needed anymore. See https://aws.amazon.com/blogs/compute/announcing-improved-vpc-networking-for-aws-lambda-functions/
Add raster endpoint with node-mapbox-gl-native to serve pre-rendered raster-images.
Check how blue-green deployments could be realized with API Gateway and Lambda.

References

AWS

https://docs.aws.amazon.com/lambda/latest/dg/programming-model.html
https://medium.com/@anjanava.biswas/nodejs-runtime-environment-with-aws-lambda-layers-f3914613e20e
https://mikhail.io/serverless/coldstarts/aws/
https://www.josephecombs.com/2018/03/05/how-to-make-an-AWS-S3-static-website-with-ssl
https://medium.com/@adil/how-to-send-an-image-as-a-response-via-aws-lambda-and-api-gateway-3820f3d4b6c8
http://erajasekar.com/posts/how-to-setup-subdomain-for-aws-api-gateway/
https://stackoverflow.com/questions/17193647/difference-between-an-a-rec-and-cname-in-route53
https://docs.aws.amazon.com/AmazonS3/latest/dev/how-to-page-redirect.html
https://stackoverflow.com/questions/45773074/cloudfront-responds-with-403-forbidden-instead-of-triggering-lambda
CloudFront cache settings
Mocking aws-sdk with jest

Typescript, JavaScript

https://blog.atomist.com/typescript-imports/
https://github.com/gotwarlost/istanbul/blob/master/ignoring-code-for-coverage.md
https://lucybain.com/blog/2018/js-es6-spread-operator/

Terraform, Infrastructure-as-Code

https://learn.hashicorp.com/terraform/aws/lambda-api-gateway
How to manage Terraform state - A guide to file layout, isolation, and locking for Terraform projects

Postgres

https://node-postgres.com/

Benchmark, Optimization

Keeping Node.js Fast: Tools, Techniques, And Tips For Making High-Performance Node.js Servers
autocannon - fast HTTP/1.1 benchmarking tool written in Node.js
Piers Cornwell: A Question of Timing

UI, Design

URL-encoder for SVG
Registry of Open Data on AWS

Vectortiles

https://docs.mapbox.com/vector-tiles/specification/
https://github.com/mapbox/vector-tile-spec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEVELOPMENT.md

DEVELOPMENT.md

aws-tileserver - Developer documentation

AWS-Architecture

Overall architecture

Creating the `postgis-client` Docker-image

Serving static Content

Serving vectortiles

Lambda Implementation Details

Database

Vector Tiles

SQL-Query

Performance, Benchmarks & Timing

Setup

Update 2023

Next Steps

References

AWS

Typescript, JavaScript

Terraform, Infrastructure-as-Code

Postgres

Benchmark, Optimization

UI, Design

Vectortiles

Files

DEVELOPMENT.md

Latest commit

History

DEVELOPMENT.md

File metadata and controls

aws-tileserver - Developer documentation

AWS-Architecture

Overall architecture

Creating the postgis-client Docker-image

Serving static Content

Serving vectortiles

Lambda Implementation Details

Database

Vector Tiles

SQL-Query

Performance, Benchmarks & Timing

Setup

Update 2023

Next Steps

References

AWS

Typescript, JavaScript

Terraform, Infrastructure-as-Code

Postgres

Benchmark, Optimization

UI, Design

Vectortiles

Creating the `postgis-client` Docker-image