BON in a Box 2.0

Mapping Post-2020 Global Biodiversity Framework indicators and their uncertainty.

A Geo BON project, born from a collaboration between Microsoft, McGill, Humbolt institue, Université de Sherbrooke, Université Concordia and Université de Montréal.

Running the servers locally

Prerequisites :

Git
Linux: Docker with Docker Compose installed
Windows/Mac: Docker Desktop
At least 6 GB of free space (this includes the installation of Docker Desktop)

To run:

Clone repository (Windows users: do not clone this in a folder under OneDrive.)
Using a terminal, navigate to top-level folder.
docker compose pull

This needs to be re-run everytime the server code changes, or when using git pull if you are not certain.
The first execution will be long. The next ones will be shorter or immediate, depending on the changes.
Network problems may fail the process. First try running the command again. Intermediate states are saved so not everything will be redone even when there is a failure.

Provide an environment file (.env) in the root folder with the following keys

# Access the planetary computer APIs
JUPYTERHUB_API_TOKEN=
DASK_GATEWAY__AUTH__TYPE=
DASK_GATEWAY__CLUSTER__OPTIONS__IMAGE=
DASK_GATEWAY__ADDRESS=
DASK_GATEWAY__PROXY_ADDRESS=

# Access GBIF API
GBIF_USER=
GBIF_PWD=
GBIF_EMAIL=

docker compose up -d
In browser:
- http://localhost/ shows the UI
docker compose down (to stop the server when done)

Servers do not need to be restarted when modifying scripts in the /scripts folder:

When modifying an existing script, simply re-run the script from the UI and the new version will be executed.
When adding/renaming/removing scripts, refresh the browser page.

Scripts

The scripts perform the actual work behind the scenes. They are located in /scripts folder

Currently supported :

R v4.1.2
Julia v1.8.1
Python3 v3.10.6
sh

Script lifecycle:

Script launched with output folder as a parameter.
Script reads input.json to get execution parameters (ex. species, area, data source, etc.)
Script performs its task
Script generates output.json, containing links to result files, or native values (number, string, etc.)

See empty R script for a minimal script lifecycle example.

Describing a script

The script description is in a .yml file next to the script. It is necessary for the script to be found and connected to other scripts in a pipeline.

Here is an empty commented sample:

script: # script file with extension, such as "myScript.py".
description: # Targetted to those who will interpret pipeline results and edit pipelines.
external_link: # Optional, link to a separate project, github repo, etc.

inputs: # 0 to many
  key: # replace the word "key" by a snake case identifier for this input
    label: # Human-readable version of the name
    description: # Targetted to those who will interpret pipeline results and edit pipelines.
    type: # see below
    example: # will also be used as default value

outputs: # 1 to many
  key:
    label: 
    description:
    type:
    example: # optional, for documentation purpose only

references: # 0 to many
  - text: # plain text reference
    doi: # link

See example

Each input and output must declare a type, in lowercase. The following file types are accepted:

File type	MIME type to use in the yaml	UI rendering
CSV	text/csv	HTML table (partial content)
GeoPackage	application/geopackage+sqlite3	Link
GeoTIFF	image/tiff;application=geotiff	Map widget (leaflet)
JPG	image/jpg	<img> tag
Shapefile	application/dbf	Link
Text	text/plain	Plain text
TSV	text/tab-separated-values	HTML table (partial content)
	(any unknown type)	Plain text or link

The following primitive types are accepted:

"type" attribute in the yaml	UI rendering
boolean	Plain text
float, float[]	Plain text
int, int[]	Plain text
options *	Plain text
text, text[]	Plain text
(any unknown type)	Plain text

* options type require an additionnal options attribute to be added with the available options.

options_example:
  label: Options example
  description: The user has to select between a fixed number of text options. Also called select or enum. The script receives the selected option as text.
  type: options
  options:
    - first option
    - second option
    - third option
  example: third option

Reporting problems

The output keys warning and error can be used to report problems in script execution. They do not need to be described in the outputs section of the description. Both will be displayed specially in the UI.

Script dependencies

Scripts can install their own dependencies directly (install.packages in R, Pkg.add in Julia, etc). However, it will need to be reinstalled if the server is deployed on another computer or server.

To pre-compile the dependency in the image, add it to runners/r-dockerfile or runners/julia-dockerfile. When the pull request is merged to main, a new image will be available to docker compose pull with the added dependencies.

Pipelines

Each script becomes a pipeline step. Pipelines support the same input and output types and UI rendering as individual scripts.

Pipeline editor

The pipeline editor allows you to create pipelines by plugging steps together.

The left pane shows the available steps, the right pane shows the canvas.

To add a step: drag and drop from the left pane to the canvas.

To connect steps: drag and drop from one output handle to an input handle. Input handles are on the left, and output handles are on the right.

To add a constant value: double-click on any input to add a constant value linked to this input. It is pre-filled with the example value.

To delete a step or a pipe: select it and press the Delete key on your keyboard.

To make an array out of single value outputs: if many outputs of the same type are connected to the same input, it will be received as an array by the script.

A single value can also be combined with an array of the same type, to produce a single array.

User inputs: To provide inputs at runtime, simply leave them unconnected in the pipeline editor. They will be added to the input.json sample file when running the pipeline.

Pipeline inputs and outputs

Any input with no constant value assigned will be considered a pipeline input and user will have to fill the value.

Drag and drop an output node and link it to a step output to specify that this output is an output of the pipeline. All other unmarked step outputs will still be available as intermediate results in the UI.

Saving and loading

The editor does not allow you to edit files live on the server. Files need to be committed to the github repo using git.

To load an existing pipeline:

Make sure you are up to date using (e.g. git pull --rebase).
Click "Load from file"
Browse to the file on your computer and open it.

To save your modifications:

Click save: the content is copied to your clipboard.
Make sure you are up to date (e.g. git pull --rebase).
Remove all the content of the target file.
Paste content and save.

To share your modifications, commit and push on a branch using git. Then,create a pull request for that branch through the github UI.

Developer documentation

The linked content is intended for those developing the microservice infrastructure supporting the pipelines.

Developer documentation

Name		Name	Last commit message	Last commit date
Latest commit History 918 Commits
.github/workflows		.github/workflows
defaultcovars		defaultcovars
http-proxy		http-proxy
output		output
pipelines		pipelines
runners		runners
script-server		script-server
scripts		scripts
ui		ui
.gitignore		.gitignore
README-dev.md		README-dev.md
README.md		README.md
compose.dev.yml		compose.dev.yml
compose.override.yml		compose.override.yml
compose.yml		compose.yml
script bloque 2 02 23 r.Rmd		script bloque 2 02 23 r.Rmd
specification.html		specification.html
specification.xml		specification.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BON in a Box 2.0

Running the servers locally

Scripts

Describing a script

Reporting problems

Script dependencies

Pipelines

Pipeline editor

Pipeline inputs and outputs

Saving and loading

Developer documentation

About

Releases

Packages

Languages

vicjulrin/biab-2.0

Folders and files

Latest commit

History

Repository files navigation

BON in a Box 2.0

Running the servers locally

Scripts

Describing a script

Reporting problems

Script dependencies

Pipelines

Pipeline editor

Pipeline inputs and outputs

Saving and loading

Developer documentation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages