Skip to content

Commit

Permalink
Merge pull request #5 from alexsla/master
Browse files Browse the repository at this point in the history
New stuff in info
  • Loading branch information
alexsla authored Aug 24, 2023
2 parents 202de89 + 9078053 commit 112937a
Show file tree
Hide file tree
Showing 6 changed files with 11 additions and 17 deletions.
Binary file added kde.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added methodology.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
6 changes: 3 additions & 3 deletions modelling_info.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -35,15 +35,15 @@ To model species presence-only data with statistical models a random sample of b

For each species we created a spatial kernel density estimate around its occurrence records, projected onto a raster masked to only include countries in which the species is known to occur. We then generated 1000 background samples for each species by randomly sampling from the masked raster, with the sampling probability for each cell weighted by the spatial kernel density estimate.

**PLOT TBA**
![](methodology.png)

### Species distribution modelling

There are only few records per species, so we fitted a logistic binary regressions using a Hierarchical Generalized Additive Model (**HGAM**; Pedersen et al. 2019) with Point Process weighting (Fithian & Hastie 2013) to increase the predictive power of the model.

We downloaded global layers of 19 bioclimatic variables representing the mean, extremes, and variation in temperature and precipitation from WorldClim v2.1 (Fick et al. 2017), and a layer of the Landsat Enhanced Vegetation Index (EVI; Masek et al. 2006, Vermote et al. 2016). We then used Principal Coordinate Analysis (PCA) to convert the layers into orthogonal Principal Components (PCs) to reduce collinearity in the predictor variables. We used the 4 first PCs, which collectively explain **XX%** of the variation in the bioclimatic variables, as predictor variables for the HGAM model.
We downloaded global layers of 19 bioclimatic variables representing the mean, extremes, and variation in temperature and precipitation from WorldClim v2.1 (Fick et al. 2017), and a layer of the Landsat Enhanced Vegetation Index (EVI; Masek et al. 2006, Vermote et al. 2016). We then used Principal Coordinate Analysis (PCA) to convert the layers into orthogonal Principal Components (PCs) to reduce collinearity in the predictor variables. We used the 4 first PCs, which collectively explain 91.8% of the variation in the bioclimatic variables, as predictor variables for the HGAM model.

The model was trained on a random sample of 80% of the data, and model performance was evaluated by making predictions on the remaining 20% and calculating the area under the receiver operating characteristic curve (AUC). The model achieved good predictive performance, with an AUC score of **XX**.
The model was trained on a random sample of 80% of the data, and model performance was evaluated by making predictions on the remaining 20% and calculating the area under the receiver operating characteristic curve (AUC). The model achieved good predictive performance, with an AUC score of 0.85.

We then used to model to make predictions for all cells in Australia and projected the resultant habitat suitability scores onto a map of Australia.

Expand Down
22 changes: 8 additions & 14 deletions modelling_info.html

Large diffs are not rendered by default.

Binary file added occ_1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added occ_all.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 112937a

Please sign in to comment.