This repository contains the functionality to standardize the rattenapp data of the Flanders Environment Agency (VMM) to a Darwin Core Archive that can be harvested by GBIF.
source data → Darwin Core mapping script using SQL → generated Darwin Core files
The repository structure is based on Cookiecutter Data Science and the Checklist recipe. Files and directories indicated with GENERATED
should not be edited manually.
├── README.md : Description of this repository
├── LICENSE : Repository license
├── vmm-rattenapp-occurrences.Rproj : RStudio project file
├── .gitignore : Files and directories to be ignored by git
│
├── .github
│ ├── PULL_REQUEST_TEMPLATE.md : Pull request template
│ └── workflows
│ │ ├── fetch-data.yaml : GitHub action to fetch raw data
│ │ └── mapping_and_testing.yaml : GitHub action to map data to DwC and perform some tests on the Dwc output
|
├── src
│ ├── fetch_data.Rmd : Fetchin data script
│ ├── dwc_mapping.Rmd : Darwin Core mapping script
│ ├── run_fetch_data.R : R script to run code in fetch_data.Rmd in an automatic way within a GitHub action
│ ├── run_dwc_mapping.R : R script to run code in dcw_mapping.Rmd in an automatic way within a GitHub action
│ └── install_packages.R : R script to install all needed packages
|
├── sql : Darwin Core transformations
│ └── dwc_occurrence.sql
│
└── data
│ ├── raw : Source data to fetch
│ ├── external : External data used during the mapping
│ ├── interim : All fetched data, input for mapping script
│ └── processed : Darwin Core output of mapping script GENERATED
- Clone this repository to your computer
- Open the RStudio project file
- Run
install_packages.R
to install any required packages - Open
fetch_data.Rmd
R Markdown file in RStudio to fetch data manually - Open the
dwc_mapping.Rmd
R Markdown file in RStudio to map data to DwC manually - Click
Run > Run All
to generate the processed data
MIT License for the code and documentation in this repository. The included data is released under another license.